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CONFIGURAL SCORING” 


PAUL E. MEEHL 


UNIVERSITY OF MINNESOTA 


(7 of the most important words in the 
vocabulary of clinicians is the word “pat- 
tern.” We speak of ourselves as thinking in 
terms of totalities, organizations, configura- 
tions; and we congratulate ourselves for being 
able to look upon case material in this pat- 
terned, non-atomistic way. The question as to 
how much we really do this, and the further 
question of how well it pays off in terms of 
predictive efficiency, is a very complicated one 
on which neither theoretical nor adequate em- 
pirical data as yet exist. But I am sure everyone, 
even the most “hardheaded,” who is at all ac- 
quainted with clinical work will at least agree 
that there are situations in which data are in- 
tegrated in a “patterned” manner. The word 
patterning is ambiguous, and I have tried else- 
where to give a formal (mathematical) defi- 
nition of patterning of variables for the con- 
tinuous case [16]. One obvious, simple instance 
of patterning is the case in which a datum is 
taken as evidence for a certain hypothesis 
about the patient when seen in a certain con- 
text of other evidence ; whereas the same datum 
would be taken as arguing against the very 
same hypothesis with the context different. For 
example, an objective history of extremely 
wide and vigorous athletic participation in high 
school would, statistically speaking, argue in 
favor of masculinity in the male. If we find 
such a history in a male aged 35, without het- 
erosexual experience, living with his mother, 
and “sponsoring”’ boys’ clubs, we begin to think 
of the latent homoerotic component as being 
strong; and the exaggerated athletic activity 
can be viewed as defense. So viewed, it is ac- 
tually part of the evidential picture in favor of 


1Written while on a (summer session) research 
appointment granted by the University of Minne- 
sota Graduate School. 

2I am indebted to Dr. Kenneth E. Clark for 
stimulating discussion and clarification of several 
points treated in this paper. 


the hypothesis that the patient has a problem 
in the area of sexual role. We are all aware of 
the dangers involved in this kind of thinking, 
which has at times a certain “‘heads I win, tails 
you lose” flavor; but most of us would say 
that a person who cannot stand this kind of 
ambiguity in the evidence ought to get out of 
clinical psychology, because it is simply the 
case that such things as reaction formation do 
occur, and if you want to deal with human 
motivation as a subject matter you have to take 
the consequences. 

A frequently voiced objection to structured 
personality tests is couched in terms of this con- 
figural character of clinical material. It is 
pointed out, quite justly, that the score on a 
test variable such as the Multiphasic Pd is ob- 
tained by simply adding the number of items to 
get a total, with no attention being paid to the 
“meaning” and “pattern” of the responses. 
While recent clinical use of instruments such 
as the Multiphasic has taken for granted the 
emphasis upon profile patterns rather than an 
atomistic emphasis upon single scales [1, 3, 5, 
6, 7, 8, 11, 15, 18, 19, 21, 22], this refers to 
patterning of the scored variables and does not 
meet the problem at the level of determining 
the single score itself, i.e., at the item level. 

An interesting paradox which I developed 
recently during a preliminary oral examination 
seems to me relevant in this connection. I take 
the liberty of calling this a paradox because 
practically all the psychologists I have since 
asked about it give the same answer as the 
candidate, which is the wrong one. The ques- 
tion is this: Consider two dichotomously scored 
test items, which we wish to use in predicting 
a dichotomous criterion. Let us ignore sampling 
errors and confine the discussion entirely to 
parameters. Suppose that the dichotomous cri- 
terion is, say, “schizophrenic” versus “normal.” 
For the present purpose the question as to 
whether some continuous variable or set of 
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variables actually underlies such a clinical 
dichotomy is irrelevant. 

Suppose now that (in the supply) each of 
the two items has exactly 50 per cent difficulty 
within each category, so that half of the schizo- 
phrenics and half of the normals answer each 
of the items “true” and half of each group ans- 
wer each item “false.” Under these circum- 
stances any of the usual methods of item analy- 
sis (phi-coefficient, significance of difference of 
proportions, tetrachoric r, and so forth) will 
show both items to have “zero validity” for the 
criterion, and they will be eliminated. Under 
such conditions, is it possible to predict the cri- 
terion solely upon the basis of the response to 
these two items, and if so, how well could it 
theoretically be predicted? Almost everyone 
who is asked this question has the initial ten- 
dency to say that the criterion could not be 
predicted at all; and most people finally come 
out with this answer even after some paranoid 
ruminations. It is, therefore, interesting to con- 
sider an extreme case. 


TABLE 1 
INTERITEM CORRELATION AMONG NoRMALS 














Item No. 2 





T F 
Item - 50 0 5 
No. 1 F 0 50 50 
50 50 199 
¢, = +-1.00 
TABLE 2 
INTERITEM CORRELATION AMONG SCHIZOPHRENICS 
Item No. 2 
= F 
Item T 0 50 50 
No. 1 F 50 0 50 
50 50 =: 100 
¢,=—1.00 
TABLE 3 


VAipiry OF ITEM-PAIR WHEN SCORED 
CONFIGURALLY 


Normal Schiz. 








Response TT or FF 100 0 100 
Pattern TF or FT 0 100 100 
100 100 __ 200 

?; = +1.00 


Validity of each item — 0.00 


Table 1 shows the correlation between the 
two items in the first criterion category and 
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tells us that all normal persons answer the two 
items in the “same way,” although they do not 
have any tendency to answer them either true 
or false. ‘Table 2 shows us that schizophrenics, 
while also having no tendency to answer either 
item true or false, are consistent in another 
way, since they always give “opposite” answers 
to the two items. However, each item, consi- 
dered singly, has no discriminating power. 
Table 3 shows the predictive efficiency of scor- 
ing these items configurally. ‘That is, we con- 
sider the response to the two items simultane- 
ously on the scoring key rather than one at a 
time. Any person who answers the two items 
in the same way (either both true or both 
false), is predicted to be a normal ; anyone who 
answers the items in the opposite way, 
whether in the true-false or false-true pattern, 
is considered to be a schizophrenic. In this hy- 
pothetical ideal case, we find a perfect predict- 
ability of the dichotomous criterion on the basis 
of the two items, even though each item has 
zero validity. It is obvious that this represents 
simply the extreme case of situations which 
can exist in lesser degrees. 

The general condition for this kind of pair- 
wise scoring to create joint validity when each 
item singly has zero validity is that the inter- 
item correlation should be different in the two 
categories of the criterion. In the ideal case 
just treated, this disparity between the inter- 
item correlations in the two categories of the 
criterion is at a maximum, since the interitem 
phi-coeficient among the schizophrenics is 
—1.09, and the corresponding interitem phi- 
coefficient among the normals is +1.00. In a 
second artificial example, the discrepancy is 
considerably less although the interitem phi- 
coefficient does differ in sign for the two por- 
tions of the population which we wish to 
separate. 

Table 4 shows that the interitem correla- 
tion among normals is .40, whereas the inter- 


TABLE 4 
INTERITEM CORRELATION AMONG NORMALS 





Item No. 2 

= F 
Item T 35 15 50 
No. 1 F 15 35 50 


50 50 §=6100 
>, = +.40 








CONFIGURAL SCORING 











TABLE 5 
INTERITEM CORRELATION AMONG SCHIZOPHRENICS 
7 : Item No. 2 
[ F 


Item T 15 35 50 





No. 1 F 35 15 : Inve ms 
i. 
$, ——.40 
TABLE 6 
Va.ipiry oF ITEM-PAIR WHEN SCORED 
CON FIGURALLY 
i. on ji Normal | Schiz. . 
Response TT or FF 70 30 100 
Pattern TF or FT 30 . 70 fs 100 
100 =6100 _ 200 


Validity of each item — 0.00 


item correlation among schizophrenics (Table 
5) is of the same absolute value but opposite 
in sign. Under these conditions, scoring the 
items configurally so that again a true-true or 
false-false pattern of response is considered 
normal and a true-false or false-true pattern 
is considered schizophrenic, we get the fre- 
quencies of Table 6, showing that the item- 
pair, scored configurally, has a validity 
coefficient of .40. The validity coefficient 
against the dichotomous criterion ob- 
tained from configural scoring of a pair of 
items when the interitem phi-coefficients are 
located symmetrically about zero is always the 
same as the absolute value of the phi-coefh- 
cients. Thus, two items each of which has zero 
validity but which have a correlation of —.60 
in one class and +.60 in the other, would 
yield when scored configurally a validity co- 
efficient of .60 in separation of the criterion 
classes. 


Consider next the case in which, although 
still opposite in sign, the interitem phi-coeffi- 
cients are not symmetrical about zero. 


TABLE 7 
INTERITEM CORRELATION AMONG NORMALS 





Item No. 2 at 
2 3 F 
Item T 37 13 50 





a No. 1 hee ve 13 37 50 
ith : 50 50-100 
¢, = +.48 
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TABLE 8 
INTERITEM CORRELATION AMONG SCHIZOPHRENICS 


Item No. 2 

T F 
Item = 22 28 50 
No. t F 28 22 50 


100 


50 50 
$, ——.12 
TABLE 9 
VaALipiry OF ITeEM-PAIR WHEN SCORED 
CON FIGURALLY 
Normal  Schiz. 
Response TT or FF 74 44 118 
Pattern TF or FT 26 56 82 
100 100 200 
?; — +.31 


Validity of each item — 0.00 


Still holding the single item validities to zero, 
in Table 7 the interitem correlation among 
normals is .48, while the corresponding inter- 
item correlation among schizophrenics is —.12. 
Calling the T-T or F-F configurations normal 
and the I-F or F-T configurations schizo- 
phrenic, we see the validity for configural 
scoring is at .31 (Table 9). The algebraic 
difference of the phi-coefficients in the two cate- 
gories of the population is .60, so that if such 
a difference (¢,; — ¢2) had been symmetrical 
about zero, i.e., an interitem phi of .30 in the 
normals and —.30 in the schizophrenics, we 
would have a configural validity of exactly .30. 
So we see that a considerable asymmetry does 
not markedly affect the magnitude of validity 
obtained through configural scoring when the 
numbers are in this general range. 

Next consider a case which is intuitively 
somewhat less satisfying but still shows how 
the configural scoring can generate validity. 
Tables 10 and 11 represent the situation of an 
item-pair which has a very high positive associa- 
tion among normals and a positive but consider- 
ably lower association among schizophrenics. In 
this instance, for both portions of the popula- 
tion the “expected” (modal) response pattern is 
one of agreement. That is, we expect the schizo- 
phrenics as well as the normals to answer both 
items in the same way, even though we have 
no expectations as to which way they will ans- 
wer them. Under such circumstances if we score 
a I-F or F-T pattern as schizophrenic, it might 
seem somewhat unreasonable, as this is not 
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what most of the schizophrenics do. But the 
alternative leaves us without any basis for 
making predictions. In Table 12 we see the re- 
sults of scoring configurally by calling all T-T 
and F-F response patterns normal, and all T-F 
and F-T response patterns schizophrenic. This 
yields a configural validity of .37 which, it will 
be noted, is .07 higher than that obtained with 
a similar algebraic difference of the two phi- 
coefficients when they are symmetrical about 
zero. 


TABLE 10 
INTERITEM CORRELATION AMONG NORMALS 


Item No. 2 


T F 
Item I 47 3 50 
ca No. 1 F 3 47 50 
ieee 50 50 100 
>, = +.88 


TABLE 11 
INTERITEM CORRELATION AMONG SCHIZOPHRENICS 


Item No. 2 
¥ y F 
Item ‘ § 32 18 50 
No. 1 F 18 32 50 
50 50 100 
>, = -+.28 
TABLE 12 
VALiwiITy OF ITEM-PAIR WHEN SCORED 


CONFIGURALLY 


Normal  Schiz. 


Response TT or FF 94 64 158 
Pattern TF or FT 6 36 42 
100 100 200 

>; = +.37 


Validity of each item — 0.00 


Since we are only now in the process of col- 
lecting some “real” material on MMPI test 
items, it has not seemed profitable to enter into 
the mathematical properties of such systems in 
any detail. I have, however, plotted the config- 
ural validity coefficient as a function of the al- 
gebraic difference of the interitem correlations 
in the two criterion categories, plotting a 
separate such curve for various values of the 
point about which the phi-coefficients are sym- 
metrical. For points of symmetry as high as 
.60, the relationship between configural validi- 
ty and the difference (¢, — ¢2) is very close 
to linear; and as we saw for symmetry about 
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zero is precisely so. The functions, however, 
become slightly positively accelerated for points 
of symmetry other than zero, so that the es- 
timated configural validity obtained by taking 
the value 


$i — $2 
2 


is an underestimate. At the extremes this in- 
troduces a considerable error. For example, 
consider a pair of items for which the phi-co- 
efficients in the two categories to be discrim- 
inated are 1.00 and .80. The difference 
(¢, — 2) is .20, which leads to an estimate 
of about .10 for the configural validity. How- 
ever, when the point of symmetry is so far 
from zero as .90, the actual configural validity 
is approximately .23. 

I have not been able to derive an exact form- 
ula for the configural validity expressed solely 
as a function of the two phi-coefficients in- 
volved, although it is of course easy to get an 
expression for a joint (configural) validity in 
terms of the proportions in the eight cells of 
the two fourfold tables. If the technique 
should turn out to be of practical value, graph- 
ical methods would be adequate for purposes 
of selecting potentially valuable item pairs. The 
sampling characteristics of the joint phi-coeffi- 
cients have not been investigated. 

If items which behave in this way can be 
found in the field of personality and interest 
measurement, it seems to me that such a tech- 
nique would have considerable practical im- 
portance. In the first place we have the straight- 
forward argument that if validity can be 
achieved with items that have no validity con- 
sidered singly, it may be possible to develop in- 
struments of a structured type to perform func- 
tions which are not generally accessible to 
structured tests. A second important considera- 
tion is in the matter of subtlety. While advan- 
ces have been made in recent years in the di- 
rection of an increase in subtlety of structured 
personality measuring instruments and in de- 
tection of distortions [4, 6, 9, 12, 13, 17, 24], 
it must be admitted that very serious problems 
in this area still remain [2, 4, 10, 11, 20]. It is 
probable that these problems will be much 
more difficult to solve with respect to certain 
dimensions, ¢.g., paranoid components of per- 
sonality, than with respect to others, e.g., som- 
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atization components. For example, the Pa 
scale of the MMPI is a relatively weak scale 
in the sense that a very sizable subgroup of 
clinically grossly paranoid individuals are 
able somehow to evade a high score with- 
out giving themselves away on the validity 
indicators. It would seem that the possibilities 
for subtlety when configural scoring is made 
use of are greatly enhanced since it would re- 
quire much more sophistication to know in 
what way certain verbal responses would tend 
to “hang together” in variously defined groups 
of people than it would to know, in general, the 
kind of response which is “healthy” as opposed 
to “maladjusted.” 

If such items can be found in a pool such as 
that of MMPI, there might also be consider- 
able theoretical significance in that it would 
leave us with a problem of interpretation from 
the factorial standpoint. For instance, at the 
present time, the theoretical conceptualization 
of the relation between so-called “obvious” and 
“subtle” items is very obscure. Wiener [22 
has interpreted his findings on subtle-obvious 
relations in successful and unsuccessful sales- 
men by assuming that the subtle items of such 
scales as Pa reflect the paranoid tendency 
faithfully, but that those who show a very high 
subtle score contrasted with their score on the 
obvious component have learned to recognize 
their own abnormality and to avoid the more 
concrete overt expression of such tendencies. 
For this reason persons with subtle Pa higher 
than obvious seem by certain external criteria 
to be better adjusted. I am certainly not in a 
position to refute Wiener’s hypothesis on any 
data available to me; but I should like to em- 
phasize that there is an alternative interpreta- 
tion which is as meaningful psychologically 
and would lead to the same psychometric 
consequences. Consider such an item as: 
“Some people are so bossy that I feel like 
doing the opposite of what they request 
even though I know they are right” which 
is scored for Pa if answered False. This 
is an excellent example of Pa subtle, both 
in terms of its obvious content which is super- 
ficially nonparanoid, and also in terms of its 
statistical property, namely, that the answer 
scored as paranoid is the answer given by the 
majority of the unselected normals of the 
standardization population. As has been point- 
ed out in previous papers [17, 23] this item 
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appears also on the Guilford-Martin Person- 
nel Inventory where the response False is 
scored as nonparanoid (i.e., opposite to the 
MMPI scoring, derived empirically on para- 
noid psychotics). Consider three groups of indi- 
viduals who are not qualitatively distinct but 
who are clustered in three general loci on a 
paranoid dimension of some sort. Since this is 
a “zero” item, the “normal nonparanoid”’ in- 
dividual in the general population will answer 
the item in the scored direction. That is, most 
normal, nonsuspicious people will not report 
that “Some people are so bossy that I feel like 
doing the opposite of what they request even 
though I know they are right.”” The prepon- 
derance of such nonsuspicious people in the 
general population sample accounts for the fact 
that this item is scored with a “O” on the 
Pa scoring template. Consider now a second 
group of people, also well within the normal 
(nondiagnosible, nonpsychotic) population, but 
who are somewhat more suspicious, stubborn, 
and hostile toward mankind than the previous 
group. One would anticipate that such people 
would tend to give the “obvious” paranoid re- 
sponse to this item, i.e., they would tend to 
answer it True. Item derivations operating 
within the nonpsychotic range whether by an 
external criterion or by methods of internal 
consistency such as were used by Guilford, 
would assign the response “True” a scoring in 
the paranoid, suspicious, uncooperative direc- 
tion—a scoring which, operating within this 
range, may very well be the valid one. Finally, 
let us consider a full-blown diagnosable para- 
noid psychotic who answers such obvious items 
as “Someone has it in for me” (True), “I am 
sure I am being talked about” (True), “I be- 
lieve I am being plotted against” (True), and 
so on. This paranoid psychotic with reality- 
distortions of delusional proportions has set up 
in addition the pseudo-objectivity which is a 
well-known facet of many of these patients as 
we see them clinically. They sometimes surprise 
us with the cautiousness and apparent intellect- 
ual honesty with which they consider alterna- 
tives to their formulations. As is true of many 
writers on controversial subjects, they raise the 
ovponent’s objections and proceed to deal with 
them—the only trouble being that they do not 
raise the right ones. Whether this almost ped- 
antic pseudo-objectivity of fairly intact para- 
noid patients is a defense erected following 
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earlier rebuff of their ideas by others, or a re- 
action against the suspiciousness which can af- 
ford to be spread widely over the area of men- 
tation provided it does not encroach upon the 
specific delusional content, it is not important 
here to discuss. The main point is that such a 
clinically diagnosable paranoid patient begins 
to reverse the trend which was detectable as we 
moved toward the more suspicious and hostile 
end of the normal range. So that this patient 
(unlike the “normal paranoid’”’ in the general 
population) gives the same response to the Pa 
subtle item as do well-adjusted, nonparanoid 
normals, although for very different reasons. 
This kind of thinking leads one to wonder 
whether the correlational properties of test 
items are not to be separately studied with the 
clinical picture and psychodynamics constantly 
in mind, at different positions on the continu- 
um to be discriminated. It might turn out that 
people who give the response “Someone has it 
in for me” (False), and also the response 
“Some people are so bossy, etc.” (False) are 
normal, well-adjusted, nonsuspcious people. 
Those who give the response “Someone has it 
in for me” (True) and “Some people are so 
bossy, etc.” (True) are suspicious, hostile in- 
dividuals in the nonclinical range, with also 
some tendencies to plus-getting on structured 
tests [17]. And finally those (very few) per- 
sons who give the response ‘Someone has it in 
for me” (True) but on the other hand also 
give the response “Some people are so bossy, 
etc.” (False) are full-blown diagnosable para- 
noid psychotics who have moved in their illness 
to the point where they have developed the 
typical overmeticulous rigid pseudo- objectivity 
of the sick paranoid. It is obvious that the 
usual method of adding points of the Pa scale 
and similar instruments would not do justice 
to the complexity of these relationships if they 
exist. It remains to be seen whether this com- 
plexity, which everybody knows occurs in the 
interview situation and in the psychopathology 
of the patients, manifests itself sufficiently 
clearly to be useful in the verbal responses to 
the materials of a structural inventory. 

The mechanics of “searching” for such 
configural scored item-pairs present practical 
difficulties. With a relatively small pool of 
items to begin with it would be feasible to set 
up the material on punch-cards and compute 
all possible interitem phi-coefficients separately 
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for each of the two groups to be discriminated. 
Arranging the phi-coefficients in parallel col- 
umns one could simply pick those item-pairs 
which showed a large discrepancy (algebraic 
difference) between ¢; and 2. With any ap- 
preciable number of items, this procedure is 
too laborious, particularly when a priori con- 
siderations do not suffice to select “likely” pairs. 
Further, the aim of subtlety in configural 
scoring might be defeated by the selection of 
item pairs on theoretical considerations. Short 
cut methods of approaching the exact “empiri- 
cal” procedure will be developed if the pilot 
studies in progress indicate sufficient advan- 
tages in the technique to warrant its develop- 
ment. 

The pair-wise configural scoring is the simp- 
lest of a whole series of configural scoring pro- 
cedures which could eventually amount to the 
scoring of fairly large numbers of items simul- 
taneously. For example, it might be that we 
give the patient one point for hysteroid tenden- 
cies if he answers item No. 37 false, on condi- 
tion that he has answered items No. 46 and No. 
52 oppositely to each other. In other words, con- 
sidering these three items in numerical order, 
the configurations F-T-F and F-F-T would be 
scored for the trait in question; whereas the 
configurations T-T-T, T-T-F, T-F-T, T-F-F, 
F-T-T, F-F-F would receive no credit. Here 
again there are practical limitations upon the 
cumbersomeness of scoring, although the ob- 
vious ease and objectivity of scoring in struc- 
tured tests is such that if it is necessary to com- 
plicate it considerably in order to gain some of 
the advantages of projective techniques it 
would be clinically justified. 


I have treated only the special case where 
the items considered singly have zero validity. 
The question as to how much it would pay to 
employ a configural scoring of item pairs, tri- 
ads, or n-ads when the component members al- 
ready contain considerable validity is a compli- 
cated problem which I shall leave to mathemat- 
ically abler workers. The aim of the present 
note is simply to bring this kind of scoring, for 
which I propose the designation configural, to 
the attention of clinicians and test-builders so 
that its possibilities may be explored in a div- 
ersity of testing areas. 


Received July 27, 1949. 
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Sees its publication in 1939, the Wechs- 
ler-Bellevue Intelligence Scale [16] has 
received extensive clinical use, and many 
papers concerning it have appeared in the liter- 
ature. Considerable emphasis has been placed 
on problems of psychiatric diagnosis, on at- 
tempts to prepare shorter scales, and on com- 
parisons with other tests. The literature on 
the reliability of the several scales and their 
component subtests is, on the other hand, de- 
cidedly limited. Except for the studies reported 
by Wechsler [16] as part of the standardiza- 
tion procedure and Canter’s [1] control popu- 
lation in his study of multiple sclerosis, the few 
reliability studies undertaken [7, 13, 14] have 
used neurotic or psychotic populations. 


THE PROBLEM OF INTELLIGENCE TEST 
RELIABILITY 


The commonly acceptable methods of deter- 
mining test reliability are the split-half, alter- 
nate-form, and test-retest techniques. The seri- 
ous limitation of the split-half technique is its 
failure to adequately indicate the dependability 
of the instrument over a period of time because 
the results show only the consistency of differ- 
entiation between individuals for a single ad- 
ministration. This method also requires a 
lengthy test, since dividing the test in half may 
shorten it to the extent that the estimate of re- 
liability becomes excessively subject to chance 
fluctuations. While the Spearman-Brown pro- 
phecy formula can be used to estimate what the 
reliability coefficient would be if each half of 
the test were of double length, it can be em- 
ployed only when the two halves demonstrate 
equal variability. 

The method of determining reliability by 
the use of alternate forms demands that their 
equivalency be established upon the same sam- 
ple. Forms are usually considered equivalent 
if they yield similar means, standard deviations, 
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and distributions. The thinking here, however, 
becomes circular in that the tests must be reli- 
able to be equivalent and equivalent to be re- 
liable. 


The method for investigating reliability by 
test-retest determines if a person’s relative pos- 
ition in a sample remains the same over a per- 
iod of time. Practice effect may cause the score 
on the retest to be higher than on the first ad- 
ministration but the consistency of practice ef- 
fect can be noted. When short periods of time 
occur between retests familiarity with the test- 
ing procedures and the recall of responses given 
on the first administration may yield a some- 
what spuriously high correlation. This prob- 
lem can be dealt with by making retest meas- 
urements at varying time intervals. Inasmuch 
as interest is usually in the stability of an in- 
dividual’s score over varying periods of time, 
the test-retest technique should be the reliabili- 
ty method of first choice. 


SPLIT-HALF RELIABILITY STUDIES 


Wechsler [16, p. 133] used a variation of 
the split-half technique in his standardization. 
He chose four subtests—Digit Span, Picture 
Completion, Information, and Block Design— 
and correlated them with four other subtests 
—Arithmetic, Comprehension, Picture Ar- 
rangement, and Digit Symbol. The correlation 
for 355 subjects, ages 20-34, was .90 after cor- 
rection for attenuation. The selection of sub- 
tests, however, seems to have been made to 
achieve maximal correlations. Subtests such as 
Object Assembly, whose correlations with 
other subtests of the scales were low, were 
omitted. A comparison of the Verbal IQ’s and 
Performance IQ’s for this same population 
yielded a correlation coefficient of .83 after 
correction for attenuation. 

Actually, these are not reliability studies at 
all. The most valid inference to be made from 
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Wechsler’s report is that the two chosen 
groups of subtests measure some common fac- 
tor to a high degree. His correlations do not 
demonstrate the consistency with which they 
do so. The correlation between the Verbal and 
Performance IQ’s may similarly be taken as 
the degree to which they measure a common 
factor, not as a measure of their dependability. 

If Wechsler’s definition of reliability were 
to be used, the several studies of abbreviated 
Wechsler-Bellevue scales [2, 4, 6, 8, 9, 10, 12, 
15] would be pertinent. These studies do not 
purport to be reliability investigations, but at- 
tempt to select subtests of the scales which can 
be combined to measure intelligence to the 
same degree as the total scales. 

A satisfactory split-half reliability study 
with the Wechsler-Bellevue is impractical for 
several reasons. There are too few items in each 
subtest to permit an adequate division into two 
halves. The heterogeneity of the subtests re- 
quires that the scale be divided by items rather 
than by subtests, if the concept of reliability is 
to have its usual meaning. 


ALTERNATE FORM RELIABILITY STUDIES 


There is no true alternate form of the 
Wechsler-Bellevue Scales although there is a 
second form [17] which Wechsler proposes as 
an alternate. The information Wechsler gives 
concerning its standardization is very limited, 
but the usual procedure of using the same pop- 
ulation for the standardization of the alternate 
as for the original test was not followed. Un- 
less the means, standard deviations, and distri- 
butions obtained for both forms are shown to 
be equal on the same population, a second form 
of a test cannot be considered an alternate. Be- 
cause Form ITI does not meet these criteria, the 
findings of investigations using both forms are 
not acceptable as evidence of reliability but 
serve only as a comparison between two simi- 
larly conceived but distinct tests. 

Gibby [5] tested 32 psychoneurotic patients 
with both forms of the Wechsler-Bellevue. 
The ages of these subjects ranged from 20 to 
45 but 75 per cent were between 20 and 30. 
He gave both tests at one administration and 
altered his presentation of forms. Comprehen- 
sion and Arithmetic were found to be signifi- 
cantly more difficult on Form II than on Form 
I. The correlation between Verbal IQ’s and 
Performance IQ’s was moderate, and that be- 
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tween the Full Scale I1Q’s high. The subtest 
and scale correlations are reported in this art- 
icle in Table 5, for the sake of completeness, 
although they are not indices of reliability in 
the usual sense. 


TEST-RETEST RELIABILITY STUDIES 


Test-retest studies not only provide some 
basis for determining the expected consistency 
of an instrument but permit an estimate of 
practice effect. In Wechsler’s report [16, p. 
133] on reliability, the only genuinely pertin- 
ent study is the correlation between retests on 
32 children ages 10-13, and 20 adults ages 20- 
34. The rank-difference method of correlation 
rather than the product-moment method was 
used. Rho was .94 for each group. The groups 
were very small, however, and there was no 
temporal control of practice effect. The individ- 
uals were retested at intervals which varied 
from one month to one year. There is no indi- 
cation as to the range of IQ’s represented nor 
as to the “normalness” of the subjects. Fur- 
ther, no report on the subtest reliabilities is 
included. 

Rabin [13, 14] reported two studies on test- 
retest with the Wechsler-Bellevue. In one 
[14], he gives the test-retest correlation on 60 
state hospitalized subjects of a wide age range 
and with miscellaneous diagnoses. The sub- 
jects were retested at intervals varying from 2 
months to 27.8 months with a mean interval 
of 13.45 months. The correlations are reported 
in this paper in Table 5. 

A second study by Rabin [13] compared 
test-retest results of 30 schizophrenics with 30 
nonschizophrenic state hospitalized subjects 
with miscellaneous diagnoses including manic- 
depressive psychosis and psychoneurosis. The 
mean age of the schizophrenics was 28.1 years, 
as compared with a mean of 38.0 for the non- 
schizophrenics. The interval between retests 
ranged from 1 to 35 months with a mean in- 
terval of 13.07 months. The very marked dif- 
ferences between the results on schizophrenics 
and nonschizophrenics can be noted by refer- 
ence to Table 5. The correlations between 
test-retest results for the mentally ill is lowest 
for schizophrenics. There are such consider- 
able fluctuations that the results of the test for 
schizophrenics seem almost meaningless. There 
are, however, marked discrepancies between 
the findings on the schizophrenic population 
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in each study. Correlations are much higher 
for nonschizophrenics. 

Hamister’s findings [7], however, do not 
confirm Rabin. His study found no significant 
differences in subtest and scale reliabilities be- 
tween a group of 34 schizophrenics, and a com- 
bined neuropsychiatric population of 53 sub- 
jects including the 34 schizophrenics. The simi- 
larity between the reliability coefficients ob- 
tained with these two groups is largely spur- 
ious, however, since there is the great overlap- 
ping of subjects between the two groups. The 
34 schizophrenics had a mean age of 29.19 and 
the combined neuropsychiatric subjects had a 
mean age of 29.26. The age range of the com- 
bined group was from 18 through 40. Thirty- 
three patients were retested after a one-week 
interval and 20 were retested after a one- 
month interval. Because there were no signifi- 
cant differences between the means of the one- 
week and one-month group they were combined 
in the computation of correlation coefficients. 
The subtest and scale correlations are report- 
ed in Table 5. 

Canter’s investigation [1] of direct and in- 
direct measures of psychological deficit includ- 
ed a test-retest study with the Wechsler-Belle- 
vue on an experimental group of 47 patients 
with multiple sclerosis and a control group of 
38 normal subjects. The age range of the 
sclerotics was from 20.5 to 47.2 with a mean 
age of 32.09. The age range of the normals 
was from 21.5 to 49.6 with a mean age of 
31.24. All subjects were retested after a six- 
month interval so that temporal practice ef- 
fect was held constant. His study was the 
first to make available any sizable, controlled 
sample for subtest reliability with a normal 
population. The findings on Canter’s normal 
population have been incorporated into the 
present study. The correlations for the sclerot- 
ics are reported in Table 5. 


PROBLEM 


It is evident from a survey of the literature 
that the most fruitful study of reliability is a 
test-retest experiment with a large sample of 
normal subjects. No such study has been re- 
ported previously. A study of subtest reliability 
is especially important because of the use of 
deviant subtest scores as pathognomonics. Stud- 
ies of the reliabilities of the various subtests that 
have been reported are for populations of neu- 
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ropsychiatric subjects only. Finally, the effect 
on the reliability of varying lengths of time be- 
fore retesting can give an indication of the sta- 
bility of the test. The hypotheses raised are: 

1. The Wechsler-Bellevue Scales have suf- 
ficiently high reliability to be considered diag- 
nostically dependable. 

2. The Wechsler-Bellevue subtests have 
sufficiently high reliability to be used for dif- 
ferential diagnosis by the deviant-score-sign 
technique. 

3. The Wechsler-Bellevue Scales and sub- 
tests show no significant differences in reliabili- 
ty which are related to length of time before 
retesting. 


PROCEDURE 


Three groups of subjects were tested and re- 
tested for this study. Thirty-eight subjects 
were retested with a six-month interval be- 
tween tests. This interval was within one week 
of the actual six-month date. Sixty subjects 
were retested with a four-week interval be- 
tween tests. This interval was within one day 
of the actual four-week date. Sixty subjects 
were retested with a one-week interval be- 
tween tests. This interval was on the exact 
one-week date. The six-month retest group 
ranged in age from 21.5 to 49.6 with a mean 
age of 31.24. The four-week and one-week 
groups fall in the age range 20 to 34, which 
was the age range of the group Wechsler 
selected for the determination of his weighted 
scores. The mean age for the four-week group 
was 25.80. The mean age for the one week- 
group was 25.46. All subjects were white as in 
the standardization population. The education- 


TABLE 1 
EDUCATIONAL LEVEL OF SUBJECTS IN PRESENT 
STUDY AND IN WECHSLER’s STAND- 
ARDIZATION POPULATION 
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Present Study Wechsler 








Category % % 
College zraduates  ............ 10.00 5.10 
Some college work ............ 14.17 3.77 
High school graduate only 60.83 10.81 
Some high school work .... 11.67 18.76 
Elementary school 
graduate only ................ 3.33 28.85 
Some elementary 
Gmesl wort ....1... 0.00 30.17 
ee ere - 0.00 2.55 
Coy or 100.00 100.01 
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TABLE 2 
THe MEANS AND STANDARD DEVIATIONS FOR THE THREE Groups OF NoRMAL SUBJECTS AND THE 
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0.6 
-+- 0.4 


One-Week (N = 60) 


14,1 
12.9 


M2 


Mi 
13.5 





Subtests 


A ae 
Comprehension........................... 
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Sresgen222 222! al background of the subjects is higher than in 
$tT FTT4 +° 1] Wechsler’s sample. It is reported and compared 
S2582823% 22) with Wechsler’s data in Table 1. The Full 
a Scale IQ range is normal and above normal. 
Sciaieisiesiee: s2a| The mean IQ for the six-month group on the 
¥ first test was 110.1. The mean IQ for the four- 
ane®eeccoe «=| week group on the first test was 118.1. The 
++4e7074 FF + | mean rage hig oe ae group on the first 
eaoctane oem eg e| test was .». Greater variation upon retest 
SeSaaasea £ =| is expected of I1Q’s above the mean than of 
‘ee 
$93127997 9]il eg aean pcan oe 
os "| tion ay is — than ps pre ‘ 
All the six-month tests and retests were ad- 
PO Re a aoe | ministered and scored by one of the authors 
SSSS6SS966 ¢--| and checked by another psychologist. Each four 
ide. ' ' "| sweek and one-week test and retest was ad- 
Sddddadadas Sas | ministered and scored by the same examiner. 
i ee ee | These examiners were completing graduate 
SAddddde as = | training in te hive —— em- 
| phasized a strict methodology of administration 
ee ee ee be > | [3] following the test manuals explicitly. The 
t++++++++ ++] research tests were given late in the training 
aeveaneee cs e-| period, after each examiner had received con- 
niss<222=2 SS) siderable supervised experience. Each test was 
annachawe med checked and rescored by one of the authors. 
S2sgacsseee 235 | All subjects were normal on the basis of hav- 
= | ing noclinically manifest neurosis or psychosis. 
ea | None were under medical or psychological 
onanaqanese - 0 | treatment at the time of testing. 
coo oo cooc So So oc so} 
++ + | | wip RESULTS 
dagadaddad eaal Practice effect on the subtests can be observ- 
ae Pee ‘ Ss | ed in a comparison of means from test to retest. 
cimcicicicie ciel cic! The differences between the means as well as 
the means and standard deviations of the 
SSS222223% 22%) weighted scores are reported in Table 2. 
t++++++++ +++] Every mean increased significantly in the re- 
~teeannee aac test. The increases ranged from .3 weighted 
SASRRSSAR FA 5 | score units for Comprehension in the four- 
eee ee week group and Vocabulary in the six-month 
SH2SSS848 SEs | group, to 2.8 weighted score units for Picture 
“| Arrangement in the one-week group. There 
| was less difference between test and retest 
Lic) Bea ! | {| means in the six-month group than was found 
tidd ; ae i | i] in the four-week or one-week groups with the 
Brae Li | ij ‘| exception of Comprehension. This may reflect 
aan : ¢ ii | | {| either the lower average IQ of the six-month 
1] i | 23 iZ | ig i| group or the diminution of practice effect with 
istpees b3.i3 j| 3 longer time interval before retest. The latter 
Bs 3 <4 ° : g< 5 E 2 i | point is also suggested by the tendency of the 
23 z 4 ga4 g 2 ®52"| four-week group to show less difference be- 
Ataseane # s & &| tween means than the one-week group. 
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Those subtests which would seem most sus- 
ceptible to simple learning showed the greatest 
gain in mean score. These were Digit Span, 
Arithmetic, Picture Arrangement, Block De- 
sign, Object Assembly, and Digit Symbol. The 
scores on all but Digit Span are especially sus- 
ceptible to practice effect because time bonuses 
are allowed. 


The standard deviations ranged in size from 
a low of 1.6 to a high of 3.6 weighted score 
points on the subtests. The standard deviations 
for the scales ranged from 9.1 to 12.6 IQ 
points. There were small differences in variabil- 
ity between the test and retest groups. Of the 
33 subtest standard deviations only 4 showed a 
difference of more than .5. Differences ranged 
from 0 to 1.1. Above a .5 difference were the 
one-week Picture Arrangement with .9, the 
four-week Object Assembly with .6, and the 
six-month Arithmetic and Picture Completion 
with .7 and 1.1 respectively. The differences 
for the scales on the one-week and six-month 
groups ranged from 0 to .3. Greater differences 
in variability occurred in the four-week group 
with a range of .9 to 1.6. 


Although most of these differences are small, 
even minor shifts become statistically signifi- 
cant when a high relation exists between meas- 
ures as is true in a test-retest experiment. In 
order to test this hypothesis, the 7’-test for the 
significance of the difference between coeff- 
cients of relative variation [11, pp. 418-419] 
was applied. Since the samples were not inde- 
pendent, the use of the correlational term in 
the formula of the standard error became nec- 
essary. This decreased the size of the standard 
error so that even small differences between 
statistics became significant. When this T-test 
was applied, every subtest and scale showed a 
significant difference upon retest. 


The Pearson product-moment correlation 
coefficients for each subtest and scale for each 
test-retest period were computed. For the sub- 
tests, weighted scores are compared. For the 
scales, IQ’s are compared. The coefficients 
range downward from a high of .91 for both 
the Vocabulary subtest and the Full Scale 1Q 
in the one-week retest as well as a .91 for the 
Full Scale 1Q on the six-month retest. The 
lowest coefficient is .53 for Arithmetic in the 
four-week retest. The scales all showed test- 
retest coefficients above .83. The Verbal Scale 
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was .84 at each retest interval. The Perform- 
ance Scale varied from .83 to .88. The Full 
Scale varied from .87 to .91. 


TABLE 3 


THe SIGNIFICANCE OF THE DIFFERENCES* BETWEEN 
THE COEFFICIENTS OF CORRELATION OBTAINED 
witH Turee Groups oF NorMAL SUBJECTS 
ReTesTtep AT One-Weexk, Four-WeEEK, 
AND S1tx-MontTH INTERVALS 


Subtests T Valuest N 
Information “is 58 129 
Comprehension scsi : 68 98 
Digit Span ......... ciaiehainsta didi 50 98 
Arithmetic 1.64 98 
Similarities 1.16 120 
Vocabulary LY 1.54 98 
Picture Arrangement .................. .79 120 
Picture Completion 84 120 
Block Design cdassebiacibals 1.50 98 
Object Assembly eee 1.05 129 
i ns 1.18 98 
Scales 
Verbal IQ ele 0.00 120 
Performance IQ .............. 1.00 120 
Full Seale IQ 1.05 129 








*Determined for the differences between the two co- 
efficients representing the extreme discrepancy among 
the three groups for each test. 


+No T value significant below 5% level. 
TABLE 4 

AVERAGE Test-ReTesT COEFFICIENTS, AND STANDARD 

ErrorRS OF MEASUREMENT FOR THE WECHSLER- 

BELLEVUE SUBTESTS AND SCALES 
(N == 158) 
Average 

Subtests* Correlationt SE,+ 
Information 86 68 
Comprehension . ; .74 1.21 
Digit Span th, 30 eae .67 1.68 
Arithmetic sndibbaiaiitiaNinatiia ' 62 2.06 
Similarities oP aT Age ae 71 1.22 
Vocabulary .............. ee iieiein 88 a 
Picture Arrangement .............. .64 1.82 
Picture Completion ..... me 83 95 
Block Design anal 84 1.10 
Object Assembly 2.0... 69 1.31 
Digit Symbol ..... — .80 1.06 

Scales* 
Verbal IQ Sainanintaia tec 84 3.96 
Performance IQ ........................ 86 4.49 
£3 ae .90 3.29 








*Standard errors are in terms of weighted score 
points for the subtests, and IQ points for the scales. 

+Average of the one-week, four-week, and six-month 
retests, by converting obtained coefficients into Fisher’s 
z functions, and reconverting the averages so deter- 
mined. 

tBased on the average correlation coefficients given 
in the preceding column. 
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In order to test the significance of the dif- 
ferences between coefficients obtained at the 
several retest intervals, the T-test was applied. 
The two most disparate coefficients for each 
subtest and scale were converted to Fisher’s z. 
The T values are reported in Table 3. None 
proved to be statistically significant below the 
5 per cent level. 

Since the time between retests was demon- 
strated not to affect significantly the size of the 
reliability coefficients, a more useful overall co- 
efficient was determined for normal subjects by 
combining the results for all 158 subjects in 
this research. This was done by the conversion 
of all coefficients for each subtest and scale at 
each retest interval to Fisher’s z, averaging z’s 
and reconverting the averages to coefficients. 
The average test-retest coefficients are reported 
in Table 4. The most stable subtests appear 
to be Vocabulary with .88, Information with 
.86, Block Design with .84, Picture Comple- 
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least stable subtests appear to be Object As- 
sembly with .69, Digit Span with .67, Picture 
Arrangement with .64, and Arithmetic with 
.62. Usually a coefficient of .70 or lower is 
considered unsatisfactory for individual predic- 
tion. The indications are therefore that the lat- 
ter subtests must be used with extreme caution 
when intratest clinical analysis is being under- 
taken on the basis of the scores on these sub- 
tests. The Full Scale is quite satisfactory with 
.90. The Verbal Scale with .84 and the Per- 
formance Scale with .86 are also minimally 
useful for individual diagnostic purposes if 
used with caution. 

As a further measure of reliability and as a 
clinically useful point of reference, the stand- 
ard errors of measurement were computed for 
each subtest and scale. The average standard 
errors of measurement for the combined popu- 
lation of 158 subjects are given in Table 4. The 
standard errors of the subtests are given in 


tion with .83, and Digit Symbol with .80. The 


terms of weighted score points. They vary from 




















TABLE 5 
Tue Test-Retest RELIABILITY OF THE WECHSLER-BeELLEVUE REPORTED 
IN THE LITERATURE AND IN THE PRESENT STUDY 
a — Aborn, 
Aborn Aborn Derner, 
Author *Gibby Rabin Hamister Rabin Hamister Rabin Canter and and Canter and 
[5] [13] [7] [14] 7) [13] [1] Derner Derner Canter 
ow - : * 
Be 3e Ag iS 
bs & ¢ . C8 a 3 ws & 
retin 3 os og ad ad 28 48 Gk 8 
=> n& 5 @ & 3S € ae = a” a4 ~ 
be )0O EE 6OGE 6B? Eee ss EG EG & &F 
he 25 Oe Ben #80 Su. Ze z= Z= z 
a ee, > = geo h e 34 60 ae) ae |) o 38 168 
Subtests Coefficients of Reiiability 
Information............ .56 -89 94 -99 -94 89 81 87 .84 87 .86 
Comprehens.on...... -20 12 -78 44 .76 62 -76 70 74 17 74 
Digit Span.............. -65 -62 .63 on .59 75 -73 .67 .70 64 .67 
Arithmetic.............. 7 75 87 .68 87 -73 74 57 53 74 62 
Similarities............ oti .38 84 -62 .86 ot .93 .65 7 70 71 
Vocabulary............. -93 -90 -90 .90 91 88 83 88 
Picture 
Arrangement...... -49 54 78 60 73 73 -86 68 59 66 64 
Picture 
Completion.......... -87 -82 .68 89 -T1 .80 7 .80 85 82 83 
Block Design.......... 87 71 67 -65 71 7 82 88 84 78 B4 
Object Assembly... .53 31 62 .67 66 o & 73 .62 .73 .69 
Digit Symbol.......... 81 .34 -79 -74 -80 91 -90 -78 -76 85 -80 
Scales 
Verbal IQ............... - .76 -78 91 89 91 87 84 84 84 84 B4 
Performance IQ..... 82 52 80 76 83 94 .90 88 83 87 86 
Full Seale IQ........... 87 -55 84 84 87 .89 -90 91 .87 91 .90 





*Comparison of Forms I and II, not test-retest. 
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.68 for Information to 2.06 for Arithmetic. 
Eight of the subtests have standard errors of 
less than 1.50 and only three of these have 
standard errors of less than 1.00. The Per- 
formance Scale is the least dependable scale 
with a standard error of 4.49. The Full Scale 
is most dependable with a standard error of 
3.29. The Verbal Scale has a standard error of 
3.96. 

The obtained coefficients for the one-week, 
four-week, and six-month intervals and the av- 
erage for the combined group are given in 
Table 5. The Table also lists all other re- 
liability coefficients on the Wechsler-Bellevue 
reported in the literature. There are many 
noticeable discrepancies between the coefh- 
cients obtained for the normal and abnormal 
populations. The significant differences be- 
tween the results of similar abnormal popu- 
lations as contrasted with the insignificant dif- 
ferences for the three normal populations sug- 
gests that the reliability of this test is consider- 
ably lower with abnormal subjects where the 
factor of the psychopathy is introduced. 


SUMMARY 


Various methods of determining the reliabil- 
ity of intelligence tests have been examined 
and evaluated. Inasmuch as split-half reliability 
is of questionable value, particularly where the 
splitting is done by subtests in a scale as heter- 
ogeneous as the Wechsler-Bellevue, and no 
truly alternate form of this test exists (al- 
though a similar test is available in Form II), 
the test-retest method would seem to be the 
most desirable reliability technique. The few 
Wechsler-Bellevue reliability studies available 
in the literature have employed abnormal sub- 
jects almost exclusively, with little attention 
paid to reliability obtained with normal popu- 
lations. 

A normal population of 158 subjects was 
tested and retested with the Wechsler-Bellevue 
Scales. The interval between retests was care- 
fully controlled with 60 subjects retested at 
the end of one week, 60 subjects retested at 
the end of four weeks, and 38 subjects retested 
after a period of six months. 

The mean score increased for every subtest 
and scale regardless of the time between re- 
tests. These increases ranged from .3 weight- 
ed score points for Comprehension in the four- 
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week retest to 2.8 weighted score points for 
Picture Arrangement in the one-week retest. 
There seems to be a lessening of practice effect 
when the interval before retest is greater. 
Practice effect is greatest on those subtests 
which measure simple learning and on those 
in which a time factor contributes to the score. 

The standard errors of measurement for the 
combined population of 158 subjects ranged 
from .68 to 2.06 weighted score points for the 
subtests. The standard error for the Verbal 
IQ is 3.96, for the Performance IQ 4.49, and 
3.29 for the Full Scale IQ. 

There were no significant differences be- 
tween product-moment coefficients at any of the 
retest intervals. The lapse of time before re- 
testing evidently does not affect the reliability, 
and the three groups of normal subjects were 
combined to yield a more useful overall coeffi- 
cient for each subtest and scale. The r’s ranged 
from .62 to .90. Arithmetic, Picture Arrange- 
ment, and Digit Span and Object Assembly are 
in the .60’s. Similarities and Comprehension 
are in the .70’s. Digit Symbol, Picture Comple- 
tion, Block Design, Information, and Vocabu- 
lary are in the .80’s. The average coefficient 
for the Verbal Scale is .84, for the Perform- 
ance Scale .86, and for the Full Scale .90. 

In the interests of maximum clinical utility, 
a summary table contains a complete review of 
all reliability studies reported in the literature 
on the Wechsler-Bellevue Scales along with 
the results of this study. It seems evident from 
this study and from the summarizations of 
other studies that the clinician must use cau- 
tion in interpreting the meaning of subtest de- 
viations. In many cases, the degree of reliabili- 
ty is too low to warrant more than the most 
tenuous hypotheses. The scales, however, and 
particularly the Full Scale, can be considered 
diagnostically dependable with normal sub- 
jects. 
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THE DIAGNOSTIC SIGNIFICANCE OF AN OBJECTIVE 
MEASURE OF INTRATEST SCATTER ON THE 
WECHSLER-BELLEVUE INTELLIGENCE SCALE 


JULES D. HOLZBERG anp MAURICE A. DEANE 


CONNECTICUT STATE HOSPITAL 


CATTER, or the manner in which a testee 
distributes successes and failures on an 
intelligence test, has long been used by psychol- 
ogists in diagnostic studies. Although scatter 
had been used very early to describe aberra- 
tions in intellectual functioning on such tests 
as the Stanford-Binet, it received its greatest 
impetus as a diagnostic technique from the 
publication of the Wechsler-Bellevue Intelli- 
gence Scale [7] and especially from the work 
of Rapaport [6]. However, most attention to 
scatter on the Wechsler-Bellevue has been di- 
rected to the problem of scatter between sub- 
tests of this scale, although clinicians have uti- 
lized scatter within a subtest qualitatively to 
arrive at diagnostic conclusions [1, p. 29]. The 
purpose of the present study was to determine 
whether scatter within subtests of the Wechs- 
ler-Bellevue can be quantitatively expressed 
and whether such a measure validly contributes 
to the diagnostic understanding of psychiatric 
patients. 


For the purpose of this study, “intratest 
scatter” is used to designate scatter within a 
subtest as opposed to “‘intertest scatter” which 
is now in common clinical usage in the analysis 


of relationship between subtests. 


Several reviews of the literature on scatter 
have been published. The present paper will 
not attempt any such review, but will refer 
only to work of specific relationship to this 
study. In their review of quantitative measures 
of scatter on the Stanford-Binet, Harris and 
Shakow [2, 3] found no clear evidence that 
such quantitative measures were valid, al- 
though they felt that quantitative scatter be- 
yond certain limits might be of pathological 
significance. Hunt and Cofer [4] are in essen- 
tial agreement that scatter on the Stanford- 
Binet appeared to have little clinical signifi- 
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cance. Nevertheless, as Mayman [5] indicates, 
there still remained the clinical impression 
that scatter on the Stanford-Binet was of diag- 
nostic importance. With publication of the 
Wechsler-Bellevue Intelligence Scale, the use 
of scatter as a diagnostic technique advanced 
and there evolved clinical and statistical evi- 
dence that intertest scatter on this scale could 
discriminate between various psychiatric groups 
[6, 7]. 

Intratest scatter, however, has received little 
research attention although it is used frequent- 
ly in current clinical practice. Although scatter 
on. the Stanford-Binet resembles intratest 
scatter as defined in this study, it differs in 
that it does not measure scatter within a specific 
psychological function. 


Rapaport has attempted the use of intratest 
scatter on the Wechsler-Bellevue in terms of 
its diagnostic significance. In his work, he 
classifies the items of several subtests into easy 
and hard items and attempts to relate failure 
on easy items with concurrent success on hard 
items to various psychiatric disorders. On sev- 
eral subtests, he rearranges the items to con- 
form to an order of difficulty suggested by his 
data, and as a result, offers an order of diffi- 
culty differing from that of Wechsler. Such a 
rearrangement, while unquestionably useful if 
valid, introduces difficulty in the clinician’s an- 
alysis of intratest scatter if for no other reason 
than the practical one that the test blanks 
which he uses to examine scatter conform to 
Wechsler’s order. In the present study, no at- 
tempt was made to rearrange the items. The 
arrangement of items as currently used by 
clinicians served as the basis for determining 
whether intratest scatter could be measured 
and whether such measures discriminated be- 
tween various psychiatric groups. 


DIAGNOSTIC SIGNIFICANCE OF INTRATEST SCATTER 


PROCEDURE 
In order to devise a satisfactory measure of 
intratest scatter, a workable definition of in- 
tratest scatter had to be established. To attain 
this, six members of the psychology staff of the 
Connecticut State Hospital were given four 
lists of scores on a hypothetical test whose 
items were arranged in increasing order of 
difficulty, and they were asked to arrange the 
four lists in order of the extent of intratest 
scatter. They were then questioned as to the 
rationale for the order they had given. The 

four scatters are listed below: 


Item Scatter1 Scatter2 Scatter3 Scatter4 

1 1 om» — 

2 _ on — de 

3 - rs _ _- 

4 som es : 

5 me rs = oo 

6 = a - 

7 in is he on 

g on — on — 

9 + + + + 


Considerable disagreement was found among 
the six raters and subsequent discussion with 
other clinicians has confirmed that psycholo- 
gists differ in their evaluation of intratest 
scatter. Analysis of the reasons given for the 
orders of arrangement revealed three distinct 
criteria for measuring intratest scatter. The 
first was based on the number of incorrect 
items before the last correct item is reached. 
Three of the raters followed this rationale and 
felt that Scatter 1 showed more scatter than 
Scatter 2. They reasoned that one may con- 
sider scatter in terms of probability, and that 
one would much more expect Scatter 2 to occur 
than Scatter 1. 


The second criterion was based on the con- 
cept of number of changes, i.e., the more 
changes from correct to incorrect, and vice 
versa, the greater the amount of scatter. There- 
fore, using this standard, the other three raters 
felt that Scatter 2 showed more scatter than 
Scatter 1. 

The third criterion was concerned with the 
location of change. Considering Scatter 3 and 
Scatter 4 where the number right, the number 
of changes, and the last items attempted are 
the same in both cases, the three raters who 
thought Scatter 1 scattered more than Scatter 
2 extended their rule of probability and felt 
that Scatter 3 scattered more than Scatter 4. 
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The other three maintained that Scatter 4 
showed the greater scatter because there was a 
greater distance between the two changes in 
Scatter 4, that is, the change was “maintained” 
for a longer interval. 

The first criterion was selected because it 
seemed the simplest to express quantitatively. 
In constructing the formula, the assumption 
was made that the subtest items of the Wechs- 
ler-Bellevue Scale were arranged in order of 
increasing difficulty. That this assumption 
might not be valid did not seem of practical 
importance, for the scatter caused by any 
items not in absolute order of difficulty would 
be common to all testees and thus be a constant 
factor. 

After discarding many proposed formulae, 
one was found which combined as many of 
the properties of scatter (using the first cri- 
terion) as was possible. That formula was 


N—N 


c= 2 aol 
T 
where 


S =the scatter coefficient 
N =the last 


item for which any credit 
was given 
N’ =the total number of items for which 


any credit was given 
“=total raw score 


The numerator of this formula can be de- 
fined as the number of totally incorrect items 
within the range of items receiving any credit. 
The denominator served to differentiate scat- 
ters in those subtests where partial credit was 
given. For example, in the following two 
scatters, both would have the same value for 
the numerator (N — N’ = 4— 3= 1), but 
Scatter 6, having a smaller value for T, would 
have the larger scatter coefficient and thus 
would indicate greater scatter. 








Item Scatter 5 Scatter 6 

1 3 3 
2 3 1 
3 0 0 
4 3 3 
T=9 7 27 
N — N’ 4—3 1 

Scatter 5 — — —et  .ii 
= 9 9 
N—N’ 4—3 1 

Scatter 6 — = ——c= .14 
T 7 7 
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Fifteen scatter measures were utilized for 
the subtests of the Wechsler-Bellevue Scale. 
They were: 


1. Intratest scatter coefficient 
(test of 25 items). 

2. Intratest scatter coefhicient for Comprehension 
(test of 10 items). 

3. Intratest scatter 
(test of 12 items). 

4. Intratest scatter coefficient for accuracy on 
Arithmetic. Here time credit was eliminated, and 
the test consisted of ten items. For example: 


for Information 


coeficient for Similarities 





Item TimeCredit Accuracy Credit Total Credit 
1 0 1 1 
2 0 1 1 
3 0 1 1 
+ 0 1 1 
5 0 1 1 
6 0 1 1 
7 0 1 1 
8 0 0 0 
9 1 1 2 
10 0 1 1 
1 =? 10 
N—N’ 1i0—9 1 
$= = =—-=.11 
= 9 9 
The total raw score (T) for arithmetic is the 


accuracy credit. 

5. Intratest scatter coeficient for Digit Span 
Forward. This test consisted of 14 items, there be- 
ing two items for each of the seven digit span 
levels, and both trials on a digit span level were 
assumed correct if the first of each 
answered correctly. For example: 


series was 





Item Series Score 
1 582 + 
2 694 + (not given) 
3 6439 —_— 
4 7286 + 
5 42731 fio 
6 75836 _ 
N—N’ 4—3 1 
s= — = =-=.25 
T 4 4 


The raw score (T) earned for the above is 4 
(the highest level of success is four digits). 

6. Intratest scatter coefficients for Digit Span 
Backward. (Test of 12 items handled the same as 
Digit Span Forward). 

7. Intratest scatter coefficient for 
(test of 42 items). 

8. Intratest scatter coefficient for Picture Com- 
pletion (test of 15 items). 

9. Intratest scatter coefficient for accuracy on 
Picture Arrangement. Here the test consisted of 


Vocabulary 
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15 items which is the maximum credit for accu- 
racy on all of the six sets of pictures with time 
credit eliminated. For example: 


Time Accuracy Total 

Items Cards Credit Credit Credit 
., 3 House 0 2 2 
3, 4 Hold Up 0 2 2 
5, 6 Elevator 0 0 0 
7, 8 9 Flirt 0 2 2 
10,11,12 Taxi 1 2 3 
13,14,15 Fish 0 0 0 
1 T=<=8 9 

N = 11; the last item correct was 711 or the 

second of three credits given for accuracy on 

"2. 
N’ = 8; items (1, 2, 3, 4, 7, 8, 10, and 11) were 


answered correctly. 


T = 8; the number of credits received for ac- 
curacy alone. (For this scale, N’ will always have 
the same value as T). 

N—N 11—8 3 
s= = —-— .38 
= 8 8 
10. Intratest scatter coefficients for accuracy on 
Object Assembly. Here the tests consists of 18 
items, the maximum number of credits for accu- 
racy with time credit eliminated. For example: 





Time Accuracy Total 

Items Objects Credit Credit Credit 
1,2,3,4,5,6 Manikin 0 6 6 
7,8,9,10,11,12 Profile 1 5 6 
13,14,15,16,17,18 Hand 2 6 S 
8 T<=17 20 

N = 18; the last item correct was *18, or the 

sixth of six credits for “Hand.” 

N’ = 17; item #12 was answered incorrectly. 

T = 17; the number of credits received for ac- 


curacy alone. (As in Picture Arrangement, the 
value of N’ will always be the same as the value 
of T). 
N—N’ 18—17 1 
Be ayer. Re 
11. Intratest scatter coefficient for Block De- 
sign-Accuracy. Here the test consisted of 21 items, 
with three items for each card and time credit 
eliminated. For example: 


=— .06 





Time Accuracy Total 
Items Card Credit Credit Credit 
1,2,3 1 2 3 5 
4,5,6 2 1 3 4 
7,8,9 3 1 3 4 
10,11,12 4 1 3 4 
13,14,15 5 0 0 0 
16,17,18 6 1 3 4 
19,20,21 7 0 3 3 
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N = 21; the last item correct was #21, or the 
third of three credits given for card 7. 

N’ = 18; items 413, 414, and 415 (card 5) were 
incorrect. (Here again, as in Picture Arrangement, 
the value of N’ is always the same as the value 
for T). 

T = 18; the number of credits for accuracy 
alone. 

N—N’ 21—18_ 3 
¥ 18 18 
12. Intratest scatter coefficient for Block De- 

sign-Time. Here the test consists of 21 items, the 
maximum number of time credits possible, with 
credit for accuracy eliminated. This coefficient was 
computed in the same way as that for Block De- 
sign-Accuracy. For example, using the same data 
as for Block Design-Accuracy above: 


== a= 7 








N = 16; the last item correct was 716, or the 
first of three time credits given on card 6. 

N’ = 6; items 41, 22, 24, 27, 210, and 416 were 
correct. 

T = 6; the number of credits for time. (The 
value of N’ will always be the same as that for 
T). 

N—N’ 16—6 10 
sS= = —=—— 1.67 
x 6 6 


13. A-second intratest scatter coefficient for Block 
Design-Time (test of seven items). For example: 


Item Time Credit Accuracy Credit Total Credit 
1 2 3 5 
2 1 3 4 
3 1 3 4 
4 1 3 4 
5 0 0 0 
6 1 3 4 
7 0 3 3 
T=6 18 24 
N = 6; the last item for which any time credit 
was given was 76, 
N’ = 5; items #1, 22, 23,24, and 26 were all 


given at least partial credit. 
T = 6; the number of credits for time. 
N—N 6—5 1 
- —— Sao 7 
= 6 6 

14. Intratest scatter coefficient for Digit Symbol 
(test of 67 items). Here N’ was equal to the num- 
ber of items for which any credit was given (one- 
half credits were considered correct). 

15. A second intratest scatter coefficient for 
Digit Symbol (list of 67 items). Here N’ was 
equal to the number of items for which full credit 
was given (one-half credits were considered incor- 
rect). 





SUBJECTS 
Three psychiatric groups, i.e., neurotic, or- 
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ganic, and schizophrenic, were chosen as sub- 
jects for the study. Five limiting factors were 
placed on the original sample selected : 


1. Age: only individuals below the age of 50 
were selected in order to minimize the effects of 
senescence on the test results. 

2. Intelligence: all those whose optimal intelli- 
gence (premorbid level of intellectual functioning) 
was dull normal or higher were included, pro- 
vided a present functioning intelligence quotient of 
at least 65 was demonstrated. 

3. Weighted Scores: only those records where 
the weighted score of each subtest was equal to at 
least one were used. 

4. Education: six of elementary school 
was the minimum of education for any member of 
the sample. 

5. Representativeness: only those records which 
were considered representative, i.e., in which at- 
tention, concentration, and effort were satisfactory, 
were used. 


years 


Twelve neurotics and eight organics met all 
these criteria. Records of forty schizophrenic 
patients who met all of the five criteria were 
available, and it was thus possible, in the com- 
parison of the schizophrenics with each of the 
other two groups, to match members of the 
sample for age, optimal intelligence, and ap- 
proximately for education. In comparing the 
organic group with the neurotic group, match- 
ing was impossible because of the limited num- 
ber of cases. The neurotic group was therefore 
reduced to eight individuals, the four records 
eliminated serving to approximately equate the 
two groups for age and optimal intelligence. 


STATISTICAL PROCEDURE 


In comparing one group with another, means 
for the fifteen different intratest scatters were 
found for each group. The difference between 
the means for each intratest scatter was tested 
for significance. Because of the small number 
of subjects, Fisher’s small sampling technique 
was used. 

FINDINGS 


Schizophrenics and Neurotics. Twelve schiz- 
ophrenics matched with twelve neurotics for 
age, optimal intelligence, and education were 
compared for intratest scatter. The mean age 
for the schizophrenic group was 29.0 with a 
range of 21 to 47 and the neurotic mean age 
was 29.5 with a range of 19 to 44. The mean 
educational grade completed for both groups 
was 10.5. The range of completed education 
was 6th grade to completion of college for the 
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neurotics and 6th grade to completion of two 
years’ postgraduate study for the schizophre- 
nics. Optimal intellectual functioning for both 
groups ranged from dull normal to very super- 
ior with a median of high average. The diag- 
nostic composition of the two groups is given 


in Table 1. 


TABLE 1 
D1AGNostic COMPOSITION OF SCHIZOPHRENICS 
AND NEUROTICS 





Schizoph renics No. Neurotics No. 
Hebephrenic 4 Reactive Depression 4 
Simple 3 Anxiety 3 
Paranoid 2 Hysteria 2 
Catatonic 2 Psychasthenia 1 
Other Types 1 Neurasthenia 1 
=. P sai _ Mixed Type 1 
Total 7 12 Total 12 


Table 2 gives the means and standard devia- 
tions of each of the fifteen intratest scatters 
for schizophrenics and neurotics, and Fisher’s 
t’s obtained after comparing these two clinical 
groups. Using a ¢ of 2.074 as being significant 
at the five per cent level (Fisher’s tables), it 
is apparent that Comprehension, Picture Com- 
pletion, and Block Design-Accuracy scatters 
are significantly greater for schizophrenics than 
for neurotics. All other differences in intratest 
scatters on other subtests are not significant. 


TABLE 2 

SIGNIFICANCE OF DIFFERENCES IN INTRATEST 
SCATTER BETWEEN SCHIZOPHRENICS 

AND NEUROTICS 
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telligence. The mean age for the schizophrenic 
group was 37.5 as compared to 37.6 for the or- 
ganics with a range of 28 to 49 for both groups. 
The range in educational accomplishment was 
6th grade to completion of 3rd year high 
school for both groups, with the mean for the 
organics at 8.4 and for the schizophrenics at 
8.3. The optimal levels of intelligence ranged 
from dull normal to bright normal with a med- 
ian of average for both groups. Table 3 lists 
the diagnostic classifications of both groups. 


TABLE 3 
D1AGNOsTIC COMPOSITION OF SCHIZOPHRENICS 
AND ORGANICS 


Schizophrenics No. Organics" 

Simple 4 C.N.S. Syphilis 7 

Catatonic 1 Psychosis due 

Hebephrenic 1 to trauma 1 

Paranoid 1 

Other Types _—_i Pea iis olen 
Total 8 Total i 8 


The means, standard deviations, and Fisher’s 
t’s for this phase of the study are reported in 
Table 4. Using Fisher’s tables, a ¢ of 2.145 was 
considered significant at the five per cent level, 
and here Information, Comprehension, and 
Block Design-Accuracy show significantly 
greater scatter for schizophrenics. All other 
differences are not significant. 


TABLE 4 
SIGNIFICANCE OF DIFFERENCES IN INTRATEST 
SCATTER BETWEEN SCHIZOPHRENICS 
AND ORGANICS 








Schizo- Schizo- 
phrenics Neurotics phrenics Organics 
M SD M SD t M SD M SD t 

Information .40 -33 -41 17 -09 Information -46 -30 -21 13 2.18 
Comprehension .58 .59 16 19 2.22 Comprehension 50 .48 13 09 2.16 
Similarities 27 -83 -20 ms | -63 Similarities 85 -95 -22 37 1.62 
Arithmetic 19 -28 .16 -22 -28 Arithmetic 12 14 -ll 17 -12 
Digit Span Forward .09 14 07 18 @ .28 Digit Span Forward ll 13 .05 08 1.12 
Digit Span Backward -10 12 -08 10 =.84 Digit Span Backward -09 11 07 13 #8 .23 
Vocabulary -25 -ll -23 16 -35 Vocabulary 19 15 32 17 =«#1.51 
Picture Completion .59 42 27 26 2.11 Picture Completion &# 44 Al 85 1.38 
Picture Arrangement 22 39 15 14 BS Picture Arrangement 39 .49 35 64 18 
Object Assembly -08 13 -02 04 1.43 Object Assembly 15 -24 .03 05 1.28 
Block Design-Accuracy .20 .29 01 06 2.11 Block Design-Accuracy .36 .33 .00 00 2.77 
Block Design-Time-1 1.29 1.28 .88  — | Block Design-Time-1 1.05 1.91 46 61 .79 
Block Design-Time-2 25 36 10 at ia Block Design-Time-2 27 -66 07 10 =3=.78 
Digit Symbol-1 -00 00 .00 01 1.00 Digit Symbol-1 00 -00 .06 08 1.87 
Digit Symbol-2 04 .08 .03 06 .87 Digit Symbol-2 .05 07 16 13 1.93 





Schizophrenics and Organics. Eight schizo- 
phrenics were matched with an equal number 
of organics for age, education, and optimal in- 


Organics and Neurotics. Eight organics and 
eight neurotics were approximately equated 
as groups. The mean age for the organics was 
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37.6 with an age range from 28 to 49. The 
mean age for the neurotic group was 32.4, the 
range being from 25 to 44. The educational 
level for the organics ranged from 6th grade 
to 3rd year high school with a mean of 8.4. 
The neurotics ranged from 6th grade to first 
year college with a mean of 10.0. For both 
groups, the median optimal intelligence level 
was average with a range from dull normal to 
bright normal. The diagnostic breakdown for 
both groups is given in Table 5. 


TABLE 5 
DIAGNOSTIC COMPOSITION OF ORGANICS 
AND NEUROTICS 


Organics 


No. Neurotics No. 
CNS. Syphilis 7 Reactive Depression 4 
Psychosis with Anxiety 2 
Trauma 1 Hysteria 1 
Neurasthenia 1 
Total 8 Total 8 


Utilizing Fisher’s tables, a ¢ of 2.145 was 
considered significant at the five percent level. 
Differences in intratest scatter were found 
significant for Information and Digit Symbol- 
2 (Table 6). With Information, scatter was 
greater for neurotics, whereas with Digit Sym- 
bol-2, scatter was greater for organics. There 
were no other significant differences. 


TABLE 6 

SIGNIFICANCE OF DIFFERENCES IN INTRATEST 
SCATTER BETWEEN ORGANICS 

AND NEUROTICS 

















Organics Neurotics 

M SD M sD t 
Information 21 13 .46 16 3.21 
Comprehension 13 .09 22 20 1.08 
Similarities -22 37 -25 18 19 
Arithmetic ll «17 -22 -24 1.00 
Digit Span Forward 05 .08 .09 14 17 
Digit Span Backward .08 13 -10 ll -42 
Vocabulary 82 17 -25 16 -80 
Picture Completion 41 -35 31 -30 -59 
Picture Arrangement 85 64 .19 16 64 
Object Assembly .03 .05 .03 02 .00 
Block Design-Accuracy .00 .00 .00 00 .00 
Block Design-Time-1 46 .61 85 -90 -95 
Block Design-Time-2 07 -10 -10 -20 31 
Digit Symbol-1 -06 .08 -00 -01 1.74 
Digit Symbol-2 16 13 -038 03 2.70 





Composite Findings. Considering the eleven 
scales of the Wechsler-Bellevue, the following 
findings are noted: 

1. Information: Both the neurotic and 
schizophrenic groups scattered significantly 
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more than the organic group, but differences 
between neurotics and schizophrenics were not 
significant. 

2. Comprehension: The schizophrenics 
were found to have a significantly greater in- 
tratest scatter than neurotics and organics, 
whereas differences between the two latter 
groups were not reliable. 

3. Digit Span: No significant differences 
between the three groups were found. 

4. Arithmetic: There were no significant 
differences between the three groups. 

5. Similarities: No significant differences 
between the three groups were found. 

6. Vocabulary: There were no significant 
differences between the three groups. 

7. Picture Arrangement: No significant 
differences between the three groups were 
found. 

8. Picture Completion: The schizophrenic 
group scattered significantly more than the 
neurotic group but not more than the organic 
group. No real difference between neurotics 
and organics was found. 

9. Block Design: There was a significantly 
greater scatter shown on Block Design-Accura- 
cy by the schizophrenic group as compared to 
the neurotic and organic groups but there were 
no differences between the latter two groups. 
Block Design-Time failed to show any real dif- 
ferences between the three groups. 

10. Object Assembly: No significant dif- 
ferences between the three groups were found. 

11. Digit Symbol: The organic group 
showed significantly greater scatter than the 
neurotic group when test items which were 
given half credit were considered incorrect. No 
other significant differences were found. 


DISCUSSION 

Wechsler in his analysis of various psycho- 
pathological groups found that intertest scatter 
is small for organics, moderate for neurotics, 
and marked for schizophrenics [7]. It has gen- 
erally been assumed in clinical practice that 
intratest scatter paralleled intertest scatter as 
reported by Wechsler. The schizophrenic is con- 
sidered the most unpredictable testee in terms 
of his pattern of successes and failures. The 
neurotic reveals fluctuating performance as a 
consequence of anxiety, although not displaying 
intratest scatter to the same extent as the 
schizophrenic. The organic, with his intellectu- 








186 


al and associational poverty, shows lowered 
performance on intelligence tests, with greater 
consistency in intratest patterning. The find- 
ings of the present study on intratest scatter 
tend to confirm these clinical impressions par- 
ticularly in regard to schizophrenics (Table 
7). However, intratest scatter as measured in 
this study shows little significant difference be- 
tween neurotics and organics, except that the 
neurotics reveal greater scatter on the informa- 
tion subtest and the organics demonstrate 
greater scatter on digit symbol. 


TABLE 7 


STATISTICALLY SIGNIFICANT INTRATEST SCATTER BE- 
TWEEN SCHIZOPHRENICS, NEUROTICS, 
AND ORGANICS 


Group Comparisons Subtests 
Schizophrenic > Neurotic Comprehension, Picture 
Completion, Block Design 


Schizophrenic > Organic Information, Comprehension, 


Block Design 
Neurotic > Organic Information 
Organic Digit Symbol 


> Neurotic 


On six Wechsler-Bellevue subtests, no signi- 
ficant differences in intratest scatter were 
found between any of the three groups. This 
would suggest that intratest scatter resulting 
from psychopathology may not be significant 
in all areas of intellectual functioning measur- 
ed by the Wechsler-Bellevue. On the other 
five subtests (information, comprehension, pic- 
ture completion, block design, and digit sym- 
bol), significant differences were found. In no 
instance did neurotics and organics show sig- 
nificantly greater scatter than schizophrenics. 
The only discrepant finding in terms of the 
expected order of extent of intratest scatter 
is on the digit symbol test where the organic 
group showed greater intratest scatter than the 
neurotics. This discrepant finding, however, is 
an artifact produced by the method of scoring 
that was adopted for this study. It was found 
that when one-half credits on this subtest were 
considered correct, no significant difference 
in scatter was demonstrated between neurotics 
and organics. When, however, these one-half 
credits were considered incorrect responses, a 
significant difference was found. This tends to 
confirm the clinical finding that organics fre- 
quently perseverate with the half credit re- 
sponse. 
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A comparison of the present findings with 
those of Rapaport are of interest since his is the 
only known attempt to apply diagnostically 
intratest scatter on the Wechsler-Bellevue, al- 
though this is done on an essentially qualitat- 
ive level. Rapaport uses no organic population, 
so no comparison with that group is possible. 
However, he does use schizophrenics and neu- 
rotics although he has a much finer breakdown 
of psychiatric classifications than was possible 
in this study. 

Table 8 lists the seven subtests for which 
Rapaport discusses intratest scatter, which is 
defined by him in terms of failing easy items 
and passing hard items. On vocabulary, the 
present study finds no significant difference in 
intratest scatter between schizophrenics and 
neurotics, whereas Rapaport finds this scatter 
characteristic of schizophrenics and certain neu- 
rotics. Actually, the results are not very con- 
tradictory because Rapaport is merely stating 
that intratest scatter of schizophrenics and cer- 
tain neurotics is greater than that of his nor- 
mals, a fact that the present study is unable to 
verify because of the absence of a comparative 
normal population. Of interest, however, is the 
fact that his schizophrenic population and some 
neurotics show little difference between each 
other, and the present study finds no difference 
in intratest scatter between the schizophrenics 
and the total neurotic group. Comprehension 
shows clear agreement between the results of 
Rapaport and those of the present study. With 
information, a similar situation exists as with 
vocabulary. Rapaport finds greater intratest 
scatter for schizophrenics and some neurotic 
groups as compared to normals and the findings 
of the present study confirm the fact that there 
is an absence of significant difference between 
schizophrenics and neurotics. On similarities, a 
marked difference in findings is shown. Rapa- 
port finds that schizophrenic and neurotic de- 
pressives show greater scatter than other neu- 
rotics whereas the results of the present study 
do not demonstrate any significant difference be- 
tween these two groups. In arithmetic, the re- 
sults of both studies are again in disagreement. 
Rapaport finds a difference between schizo- 
phrenics and neurotics, a difference which is not 
revealed in the present study. On the picture 
arrangement test, Rapaport again finds schizo- 
phrenics and some neurotics showing greater 
intratest scatter than normals. The results of 
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the present study merely indicate that no statis- 
tically significant difference in intratest scatter 
was obtained between schizophrenics and neu- 
rotics. On the picture completion test, findings 
of the present study are in complete agreement 
with that of Rapaport. 


TABLE 8 
RAPAPORT’S FINDINGS ON INTRATEST SCATTER WITH 
SCHIZOPHRENICS AND NEeuROTICS COMPARED 
WITH FINDINGS OF PRESENT STUDY 





Subtest* 





Present Findings Rapaport’s 


Findings 


Information 





No significant 
difference 


Characteristic of 
schizophrenics and 
certain neurotics. 
Characteristic of 
schizophrenics. 
Characteristic of 
schizophrenics and 
severe neurotic 
depressives ; not 
characteristic of 
other neurotics. 
Characteristic of 
schizophrenics ; not 
characteristic of 
neurotics. 
Characteristic of 
schizophrenics and 
certain neurotics. 


Comprehension Schizophrenics > 
Neurotics 
Similarities No significant 
difference 


Arithmetic No significant 


difference 


Vocabulary No significant 


difference 


Picture 
Completion 


Schizophrenics > 
Neurotics 


Characteristic of 
schizophrenics ; not 
characteristic of 
neurotics. 


Picture No significant Characteristic of 
Arrangement _ difference schizophrenics and 
depressive 
neurotics. 





*These seven subtests are the only ones for which 
Rapaport presents findings on intratest scatter. 

Of the seven subtests wherein it has been 
possible to compare Rapaport’s study with the 
present study, comprehension and picture com- 
pletion reveal findings in complete agreement 
with those of Rapaport, and partial agreement 
is found with vocabulary, information, and 
picture arrangement. The discrepancies that do 
appear in the findings may well be the result 
of the fact that Rapaport has rearranged the 
items of subtests in many instances to conform 
to his found order of difficulty, has used a finer 
and more homogeneous classification of diag- 
nostic groups than was possible here, and has 
used a concept of scatter which is not identical 
with that developed in this study. 

The application of the measure devised for 
intratest scatter to more homogeneous psycho- 
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pathological groups may well sharpen the dif- 
ferences that were not apparent in the present 
study. No attempt was made to rearrange the 
items of the subtests to conform to a presumed 
order of difficulty since the arrangement of 
items offered by Wechsler is that which is used 
in current clinical practice. To accept Rapa- 
port’s findings of order of difficulty would im- 
ply acceptance of his control population as 
“normal,” an assumption the writers were not 
prepared to make. To change Wechsler’s order 
of difficulty would alter the concept of intra- 
test scatter as it is usually used by clinicians. 
It was felt that whatever discrepancies that 
were present in Wechsler’s order of difficulty 
would affect all groups equally and this factor 
would then be constant. However, with a 
more accurate scaling of items, the present 
technique devised for measuring intratest 
scatter might more sharply discriminate be- 
tween groups, and perhaps show significant 
differences where none was demonstrated. 

It should be emphasized that the absence of 
significant differences in intratest scatter on 
certain subtests between the three pathological 
groups used does not imply that these three 
groups do not scatter more than normals. Since 
a comparable normal population was not avail- 
able, it is not possible to designate what intra- 
test scatter for normals would be. It is there- 
fore feasible that comparison of these three 
groups with a normal population might demon- 
strate that, although the three groups do not 
show significant differences in intratest scatter 
among themselves on certain subtests, these 
three groups show significantly greater intra- 
test scatter than normals on these same subtests. 

Certain precautions in the interpretation of 
the results should be emphasized. Clearly, the 
size of the groups used were too small to per- 
mit indiscriminate application of these findings 
to clinical cases. The age range was rather 
large for the sample used, and it would have 
been desirable to limit the subjects used to more 
homogeneous diagnostic groups. Control data 
on the extent of intratest scatter within a nor- 
mal population are highly desirable and essen- 
tial. In spite of these limiting factors, however, 
the present study does offer a new approach to 
the measurement of intratest scatter and is de- 
serving of further study. 

It is not the authors’ purpose to offer an ob- 
jective measure of intratest scatter that will 
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ever eliminate the need for disciplined clinical 
judgment in evaluating intratest scatter. The 
measure that has been devised is merely conceiv- 
ed as an adjunct to present clinical practice 
and to the extent that this measure helps to 
sharpen clinical practice, to that extent will it 
serve as a contribution to the field of diagnos- 
tic testing. It may also be useful in research 
where there is need to measure the extent of, 
or change in, intratest scatter. 


SUMMARY 

An objective measure of intratest scatter on 
the Wechsler-Bellevue subtests was devised and 
applied to a group of neurotics, organics, and 
schizophrenics. Intratest scatter of schizo- 
phrenics was found to be significantly greater 
than that of neurotics on comprehension, pic- 
ture completion, and block design. Schizophre- 
nic scatter was also reliably greater than or- 
ganic scatter on information, comprehension, 
and block design. When neurotics were com- 
pared with organics, neurotics scattered signifi- 
cantly more than organics on information, and 
less than organics on digit symbol when one- 
half credits on this test were considered incor- 
rect. 
Received July 20, 1949. 
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PERSONALITY PROBLEMS OF THE SPINAL CORD INJURED" 


ALFRED D. MUELLER 


VETERANS ADMINISTRATION HOSPITAL, MEMPHIS, TENN. 


NUMBER of studies have been made in 

recent years concerning personality char- 
acteristics, emotional conflicts, feelings and at- 
titudes, and general adjustment problems of the 
spinal cord injured. Three such studies made 
during the past year are selected as the basis 
for appraising personality characteristics of 
paraplegic patients and for evaluating the psy- 
chological factors involved in the rehabilitation 
process. They were selected mainly because 
they utilize different measurement techniques 
in studying these patients. 

Mosak [2] studied a group of twenty-five 
paraplegic patients in a Midwest VA Para- 
plegic Center, and an equal number of non- 
paraplegic veterans as controls. He compared 
the two groups on the basis of test results of 
the Harrower-Erickson Group Rorschach and 
the Draw-a-Person test. 


McLaughlin [1] studied a group of twenty- 
five paraplegic patients in two Eastern VA 
Paraplegic Centers. He based his study on the 
test results of the Wechsler-Bellevue Scale and 
the Thematic Apperception Test. 

Mueller and Thompson [3] studied an un- 
selected group of 140 paraplegic patients in a 
Southern VA Paraplegic Center. They employ- 
ed tests of mental ability and of occupational 
interests as means of stimulating, directing, and 
motivating rehabilitation activities. 

Using the method of differences of means, 
Mosak found certain statistically significant 
differences in personality structure between his 
two groups. First, he found that the paraplegic 
group gave a larger number of bad responses 


1From Psychology Section, Veterans Administra- 
tion Medical Teaching Group, Kennedy Hospital, 
Memphis, Tennessee. Sponsored by the Veterans 
Administration and published with permission of 
Chief Medical Director, Department of Medicine 
and Surgery, Veterans Administration, who as- 
sumes no responsibility for the opinions expressed 
or the conclusions drawn by the author. 


on the Rorschach test. The M, CF, and F+ 
differences were significant at the 1 per cent 
level; C, Y, and F— differences were signifi- 
cant at the 2 per cent level; and Sum-C differ- 
ence was significant at the .1 per cent level. 
Third, differences in content were found as 
follows: Blood, Fire, Anatomy, and Card De- 
scription were significant at the 5 per cent lev- 
el; and Dirt and Human responses were signi- 
ficant at the 1 per cent level. 

According to Beck, human movement re- 
sponses (7) are representative of very deen 
wishes, innermost psychic activity, kinaesthetic- 
ally determined. Hence, it is hypothesized that 
the reduction in human movement responses in 
the paraplegic group is due to reduction in kin- 
aesthetic stimulation. The Draw-a-Person test 
gave some support to this hypothesis, since only 
six of the paraplegic group drew persons whose 
legs were functional, while twelve of the con- 
trol group drew figures in which movement 
seemed apparent. 

The reduction in human responses (H) 
shows that paraplegics tend to lack relation- 
ship with other people. There is poor human 
identification or empathy in a paraplegic group 
and less facility for forming associations with 
other people. This characteristic is also shown 
in the affective sphere. The paraplegic responds 
primarily to emotional elements of his environ- 
ment and is easily motivated by external stim- 
uli. His most prevalent emotional pattern is 
one of adolescent egocentricity, and there are 
many elements of infantile, uncontrolled affec- 
tive behavior. Manifestations of this uncon- 
trolled emotionality and potential explosiveness 
appear in the number of pure Color responses 
and of Fire responses. 

More shading, or Y responses, were elicited 
from the paraplegic than from the control 
group. The difference is significant at the 2 
per cent level. This indicates a tendency to 
anergic states and passivity as well as dys- 
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phoric or depressive moods. 

Specific personal needs are reflected in con- 
tent materials. The paraplegic records show 
many Anatomy, Blood, and Dirt responses. 
These “reflect their constant preoccupation 
with their injuries, with the incessant treat- 
ment, and with the ‘messiness’ of their condi- 
tions, especially with respect to their incontin- 
ence.” 

Mosak states that “the paraplegic may be 
described as one who is motivated with greater 
ease by stimuli external to himself and whose 
emotional makeup is primarily egocentric. 
However, the potentiality for mature emotion- 
al response as well as for violent explosiveness 
is still present. He is intensely preoccupied with 
his own condition and the ‘messiness’ of his 
life, is aware of the inutility of his legs con- 
stantly, but knowing that protest will not alter 
his condition, has become passive in his relation- 
ships. He is in many instances depressed and 
listless. The facility for association with other 


people has decreased, but he still observes group 
mores.” 


McLaughlin undertook to investigate emo- 
tional conflicts, adjustment difficulties, and 
personality characteristics and attitudes of 
paraplegic patients as revealed by twenty TAT 
pictures. The themas were analyzed for con- 
scious content on seven categories considered 
applicable to paraplegics, as follows: (1) at- 
titude toward health and strength; (2) social 
adjustments; (3) emotional maturity; (4) emo- 
tional tone; \(5) role in life; (6) attitude toward 
sexual adjustment; (7) attitude toward the 


future. His analysis indicates the following re- 
sults: 


Health and strength. A total of 134 stories, or 
27.4 per cent of the themes, expressed attitudes to- 
ward health and strength. Of this number, 70.1 
per cent expressed the attitude that health and 
strength are necessary for accomplishment, while 
28.4 per cent feel that they are being deprived of 
normal contacts because of their physical hand- 
icaps. 

Social adjustment. Of 365 stories, 73 per cent 
expressed attitudes toward social adjustment. Atti- 
tudes of bitterness and cynicism were most preva- 
lent (45 per cent). Apprehensiveness was ex- 
pressed in 38 per cent of the stories; and affec- 
tionate, congenial friendships were expressed in 
only 17.3 per cent. 

Emotional maturity. Two hundred seven of the 
themes were rated under this category. The tend- 
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ency to act impulsively was most prevalent (42.5 
per cent). Mature emotionality appeared in only 
13.5 per cent of the themes. 

Emotional tone. Four hundred thirty stories were 
rated under this category. Feelings of insecurity 
were revealed most frequently (32.2 per cent). 
Depression was evidenced in 112 of the themes and 
is one of the most outstanding features of the per- 
sonality structure. Happiness is shown in only 10 
per cent of the stories. 

Role in life. Of 430 stories rated in this cate- 
gory, ambivalence and indecision of action were 
the most prominent manifestations (41.9 per cent). 
Submission and dependence were the next most 
prominent expressions (36.3 per cent). 

Attitude toward sexual adjustment. Only 92, or 
18.4 per cent, of the stories deal with this cate- 
gory. Of this number, 59.8 per cent indicated mal- 
adjustment in the psychosexual sphere. Ambiv- 
alence was expressed in 12 per cent of the stories, 
and good adjustment in 28.2 per cent. 

Attitude toward future. Expressions toward the 
future were expressed in 442 stories, or 88.4 per 
cent of the total number. The most outstanding 
characteristic was an attitude of depression (43.4 
per cent). A type of autistic thinking or unrealis- 
tic success was the next most prevalent attitude to- 
ward the future (25.8 per cent). Fear of the fu- 
ture was expressed in 16.1 per cent, while hopeful 
accomplishment was expressed in only 14.7 per 
cent. 


Summarizing the personality characteristics 
and attitudes of the paraplegic patients in this 
study, we find: (1) Physical strength and 
health are considered necessary for confidence 
and accomplishment, and that the patients gen- 
erally have not accepted their injury and its 
limitations. Conflict between desires and in- 
ability to perform permeates the personality and 
produces a feeling of dependency. (2) There is 
ambivalence concerning role in life, producing 
indecisiveness and feelings of insecurity, which 
are reflected in social adjustment and relative 
lack of emotional control. Feelings of apprehen- 
sion and cynicism are manifested in the sphere 
of social adjustment. (3) There is a relative 
lack of conscious expression concerning sexual 
adjustment. However, analysis of obtained data 
reveals maladjustment in the psychosexual 
sphere. (4) Depression is the most prevalent 
personality characteristic. It is revealed in the 
general tone of expression as well as in the out- 
look toward the future. Unrealistic success, a 
form of autistic thinking, also characterizes the 
outlook toward the future. 

These basic psychological factors should be 
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considered in the rehabilitation of paraplegic 
patients. Mueller and Thompson studied an 
unselected group of 140 paraplegic patients, 19 
of whom were partial quadriplegics, in order 
to appraise the effectiveness of psychological 
evaluation as a means of motivating rehabilita- 
tion activities. Each patient was given tests of 
intelligence and vocational interests. In addi- 
tion, he had numerous guidance interviews 
with physicians, psychologists, vocational guid- 
ance counselors, social workers, and rehabilita- 
tion workers. ‘The importance of the role that 
psychological factors play in rehabilitation was 
recognized. It should be emphasized that a tre- 
mendous psychological load is placed upon the 
personality as it must absorb the traumatic ef- 
fects of injury, shock, extended periods of ill- 
ness, severe physical handicaps involving loss 
of nerve and muscle control, impaired bowel 
and bladder fuctioning, altered sex function- 
ing, and the multitude of adjustments that 
must be made in facing everyday problems of 
life. Considerable situational depression occurs, 
especially in the early stage of paraplegia. 
There are feelings of hopelessness, resentment, 
self-pity, dependence, and other abnormal atti- 
tudes that require psychological treatment in 
order that the patient may face his problems 
realistically and will be willing to take an ac- 
tive part in rehabilitating himself. 


The studies here reported agree that de- 
pressive reaction is characteristic of many para- 
plegic patients, and that they have a tendency 
to react with less mature emotional expression, 
to show greater dependency needs, and to be 
unrealistic in their thinking. The suggested 
solutions for overcoming these emotional handi- 
caps are largely psychological in nature and in- 
volve helping the patient accept reality, estab- 
lish independence and self-sufficiency, and take 
an active part in his own rehabilitation. 


This was essentially the procedure employed 
by Mueller and Thompson. By means of psy- 
chological evaluation, motivation, stimulation, 
and morale building, the patient was helped to 
overcome his frustrations and to establish 
healthy emotional and attitudinal adjustments 
to his disability. The program involved gain- 
ing the patient’s confidence and cooperation, 
overcoming wishful thinking regarding recov- 
ery, occupational plans, and home and commun- 
ity living, and to view them realistically and 
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purposefully, to the end that he would partici- 
pate willingly in his rehabilitation program. 
The responsibility for motivation was a cooper- 
ative undertaking shared by the entire staff. A 
certain amount of initial resistance was found 
in many patients to rehabilitation activities re- 
quiring physical and mental effort on their part. 
Characteristic reactions were: indifferent, am- 
bivalent, or depressive attitudes toward the 
future and the possibilities it could hold for 
them socially and vocationally; indecisiveness 
and feelings of insecurity; and assuming a sub- 
missive and dependent role in life, with accom- 
panying immature emotional behavior. 

In order to change these unhealthy mental 
states and resolve emotional conflicts, a con- 
certed program of motivation was undertaken 
aimed at mobilizing physical, mental, and per- 
sonality resources in the patient and directing 
them toward his rehabilitation. To achieve 
maximum rehabilitation of patients with spin- 
al cord injuries, we must consider not only res- 
toration of maximal physical and emotional 
fitness, but also training for vocational and oc- 
cupational activities commensurate with men- 
tal abilities, vocational interests and aptitudes, 
and personality characteristics. 

The rehabilitation program was comprehen- 
sive in order that the needs, interests and capac- 
ities of individual patients could be satisfied. 
Throughout the rehabilitation process, stress 
was placed on individual therapy. Activities 
were not engaged in for the sake of activity, 
per se, but were evaluated in terms of their 
therapeutic significance. There was an inter- 
play of both physical and psychological medi- 
cine, stressing treatment of the patient as a 
whole, physically and emotionally. As the pa- 
tient regains physical, social, and vocational in- 
dependence through this therapeutic process, he 
gradually regains confidence, overcomes fear, 
loses the attitude of hopelessness and defeatism, 
and reflects a fundamental change in attitude 
and outlook. 


The experimental study shows that the pro- 
gram is producing significant results in terms 
of personal, social, and vocational adjustment. 
One hundred thirteen, or 80 per cent, of the 
patients included in the study are actively pur- 
suing their expressed vocational interests. Fifty 
have been discharged from the hospital during 
the one-year period of the study. 
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The psychological thread that runs through 
all rehabilitation activities is that of therapy 
through interest and motivation. Without these, 
rehabilitation activities are empty shells and the 
process is doomed to failure. Recognizing their 
importance, special efforts were made to dis- 
cover the capacities, interests, and personality 
characteristics of each patient and to use them 
as foundations upon which to build a sound re- 
habilitation program. Psychological tests were 
used both for diagnostic and motivational pur- 
poses. Information revealed by the tests was in- 
valuable in assisting physicians and rehabilita- 
tion staff workers in evaluating and under- 
standing patients, as well as in helping patients 
to know and understand themselves better. 
With this knowledge, the way is opened for a 
sound cooperative rehabilitation program in 
which the patient’s personality becomes the 
center of the therapeutic process, and the ac- 
tivity becomes the means for carrying on thera- 
py. The program is eminently successful as 
judged by an estimate of upwards of 70 per 
cent who achieve good psychological adjust- 
ment, and the high percentage of patients—ap- 
proximately 80 per cent—who have been dis- 
charged from the hospital and who are success- 
fully adjusting to normal community life at 
home. 


SUMMARY AND CONCLUSION 


Personality problems of the spinal cord in- 
jured, as revealed by recent psychological stud- 
ies, are summarized, and their bearing on the 
rehabilitation of these patients is outlined. Per- 
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sonality characteristics and attitudes revealed 
most frequently are: situational depression as- 
sociated with the disability and with future 
role in life; a tendency to immature emotional 
expression characterized by impulsiveness, ex- 
plosiveness, and egocentric behavior ; a tenden- 
cy toward ambivalence and indecision charac- 
terized by passive, submissive, and dependent 
behavior; and, a tendency toward autistic or 
unrealistic thinking regarding problems of life. 

The suggested solutions for overcoming these 
emotional handicaps are largely psycholgi- 
cal in nature, and involve helping the patient, 
through various therapies, to accept himself 
and his problems on a mature emotional basis, 
assume a realistic attitude toward his future 
activities, establish independence and self-suffi- 
ciency, and take an active part in his own re- 
habilitation. 


A successful program of rehabilitation, based 
on a psychological approach, is discussed. 


Received July 25, 1949. 
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THE MEASUREMENT OF INTELLIGENCE OF ONE 
HUNDRED TWO MALE PARAPLEGICS' 


MORSE P. MANSON 
BIRMINGHAM VETERANS ADMINISTRATION HOSPITAL 
VAN NUYS, CALIFORNIA 


PPROXIMATELY one thousand 

healthy young adults each year in the 
United States are made helpless by severe spin- 
al cord injuries [5]. Therefore, the rehabilita- 
tion of the paraplegic may be considered to be a 
permanent problem. 

Psychologically, the paraplegic absorbs the 
trauma of pain, shock, infection, extensive and 
intensive hospitalization, loss of movement, re- 
duced — if not totally lost — sexual function, 
impaired bladder and bowel control and feel- 
ings of depression and dependence. With a 
single blow to the spinal cord an individual, 
perhaps at the peak of his physical development, 
is reduced from the independence of an adult to 
the dependence of a neonate. No personality, 
no matter how strong and stable, can face such 
a devastating onslaught without severe psychic 
repercussions [2, 3]. 

Because of the severity of the physical and 
psychological injuries to the paraplegic, many 
rehabilitation workers have felt that there 
might be serious intellectual impairment. Psy- 
chiatrists often state that there is a “breaking 
point” for each person, implying that to go be- 
yond this point would result in a breakdown of 
emotional and mental functions. Do the trau- 
ma of paraplegia take the paraplegic beyond 
the “breaking point”? Are there any changes 
in intelligence attributable to the injuries ac- 
companying paraplegia? An analysis of intelli- 
gence test scores of paraplegics may answer 
these questions. 

No studies dealing with the measurement of 
intelligence of paraplegics have been seen in the 
literature. One paper [7] reported the use of 

1§ponsored by the Veterans Administration and 
published with the approval of the Chief Medical 
Director. The statements and conclusions published 
by the author are a result of his own study and do 


not necessarily reflect the opinion or policy of the 
Veterans Administration. 
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projective techniques on paraplegics but found 
such techniques to have less value than the in- 
terview ; besides, paraplegics showed no disposi- 
tion to participate in any tests which did not 
seem to have immediate value to them. Two 
studies mention the results of mental ability 
tests taken prior to the injury. Thorn et al. [7] 
noted that general mental ability ranged from 
low scores in Grade IV to high scores in 
Grade I on the Army General Classification 
test. On the same test, Ginzberg and Klarman 
[1] found paraplegics to be about equally di- 
vided among “low,” “medium,” and “high” 
scores. It thus seems that there might have 
been a normal distribution of scores on the 
AGCT prior to paraplegia. 


METHOD 


Practically all the paraplegics in Birming- 
ham Veterans Administration Hospital had 
been interviewed concerning their work plans 
upon discharge after reaching maximum hos- 
pital benefits. The various resources of the hos- 
pital rehabilitation program were described 
and the paraplegics were urged to use these 
services. Vocational advisement was but one of 
many services. Some paraplegics did not feel 
the need for advisement and refused to partici- 
pate, stating that work of any kind would be 
impossible for them. These had decided to live 
upon their pensions. Others had plans which 
they hoped to follow without assistance from 
any one; still others had plans but felt, for 
varied reasons, not yet ready to take positive 
action. However, a large percentage were in- 
terested in vocational advisement and planning. 
Many of these were given intelligence tests as 
well as other tests. These intelligence tests sup- 
plied the data in this study. 


It has been observed [1] that “those who 


had the greatest schooling were most desirous 
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of further schooling, the most favorable dis- 
position being found at the college level, where 
four out of five wanted to continue their edu- 
cation. . .about 70 per cent of those who did 
not desire further schooling were also disinter- 
ested in occupational training.’’ It seems reason- 
able to assume that the group using advisement 
services was better motivated, had more edu- 
cation, and was more intelligent than the 
group not using advisement services. 


It is thus probable that the group of para- 
plegics availing itself of vocational advisement 
services was a select one and not representative 
of the total paraplegic population in Veterans 
Administration hospitals. 

The intelligence scores of 102 paraplegic 
veterans, all male, were obtained and analyzed. 
Within this group, 84 paraplegics had com- 
pleted vocational advisement and developed 
training programs leading to specific employ- 
ment objectives. 18 paraplegics, at the time of 
this study, had not completed advisement. 
These two subgroups were compared. 


GROUPS 


The age range for the total group was from 
18 to 38 years, with a mean age of 25.3 years. 
The subgroup completing advisement was 
slightly younger than the subgroup not com- 
pleting advisement. The educational range of 
years of education completed was from 6 to 15 
years for the completed subgroup and 3 to 15 
years for the incompleted subgroup; the means 
of years of education completed were 11.1 and 
10.2 years respectively. A difference signifi- 
cant at the five per cent level of confidence 
was found for years of education completed. 

In the subgroup completing advisement, 55 
per cent had been privates or privates first 
class, 28 per cent noncommissioned officers and 
17 per cent commissioned officers. The percent- 
age of commissioned officers is much larger than 
would be found in a random sample, once again 
indicating that this group of paraplegics is a se- 
lect one. Eighty-four per cent of completed 
paraplegics had been in the Army, 8 per cent 
in the Navy and 8 per cent in the Marine 
Corps. Of the subgroup completing advise- 
ment 86 per cent were service connected dis- 
ability cases compared with 76 per cent of the 
subgroup not completing advisement; thus 12 
per cent and 24 per cent of the respective 
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subgroups were non-service-connected para- 
plegics, being accounted for by civilian-type 
accidents. Approximately 90 per cent of the 
total group were white veterans and 10 per 
cent nonwhite veterans. 


RESULTS 


Three different intelligence tests were used. 
They were: (a) California Test of Mental 
Maturity, Advanced ’47 S-Form [6], (b) Otis 
Quick-Scoring Mental Ability Test, Gamma, 
Form Am [4], and (c) Wechsler-Bellevue 
Scale, Form I, [8]. 

On each of the three tests, as well as on a 
composite mean of the three tests, the mean in- 
telligence quotient for the group of 102 para- 
plegics was above 100. On two mean scores, 
for the nonlanguage factors of the California 
for the incompleted subgroup, and the Otis 
for the incompleted subgroup, did the mean IQ 
fall below 100. 

The mean IQ for the total group on the 
California was 108.9, placing at the 70th per- 
centile. The completed subgroup made a higher 
mean IQ than the incompleted subgroup, but 
the critical ratio of 1.4, although indicating a 
trend is not statistically significant. A mean IQ 
of 114.1 on the language subtests was found 
for the total group. The difference between 
the subgroups on language IQ’s was insignifi- 
cant. The mean IQ on nonlanguage subtests 
for the total group was 106.1. A highly signifi- 
cant difference was found between the com- 
pleted and incompleted subgroups, indicated 
by a critical ratio of 4.3, on nonlanguage test 
performance. 

If it is assumed that the language tests on the 
California measure relatively stable mental 
functions while the nonlanguage tests are more 
prone to change with mental deterioration, a 
measure of deterioration may be constructed. 
This crude measure might be indicated by di- 
viding the nonlanguage IQ by the language 
IQ. If the nonlanguage IQ is the same or 
higher than the language IQ it would be as- 
sumed that no mental deterioration had oc- 
curred. 

Using the above formula, 4.8 per cent deter- 
ioration was found for the paraplegics com- 
pleting advisement ; 26.2 per cent for those not 
completing advisement; and 7.1 per cent for 
the total group taking the California test. A 
percentage higher than 10 per cent might be 
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TABLE 1 





elie Range of 
Groups IQ’s 
Completed Advisement.......... ‘ 57 64-156 
Incompleted Advisement......... 11 69-141 
| IRR IE it 68 64-156 
Completed Advisement........... 57 66-138 
Incompleted Advisement......... 11 80-130 
I incre cnnntontenpeciincs 68 66-138 
Completed Advisement... 57 75-141 
Incompleted Advisement.. 11 75-125 
ae 68 75-125 


viewed with suspicion of mental deterioration 
(Table 1). 

The mean IQ for the group of 23 parapleg- 
ics taking the Otis intelligence test was 106.2. 
There was no significant difference between 
the subgroups, although the subgroup complet- 
ing advisement made the higher mean IQ 


(Table 2). 
TABLE 2 


Otis Quicx-Scortnc Test oF MgeNTAL MATuRITY 








Groups : No. Range of Mean 














CALIFORNIA TEST OF MENTAL MATURITY 


Mean 


SD of Critical Deterioration 

IQ 1Q’s Ratio % 

Language subtests 
114.0 20.0 4.8 
114.5 23.0 ol 26.2 
114.1 20.8 7.1 
Non-language subtests 
108.5 9.8 

$4.5 16.8 4.3 
106.1 10.6 

All subtests 

112.9 14.0 
105.3 15.5 1.4 
108.9 15.2 


Eleven paraplegics were administered the 
Wechsler-Bellevue test of intelligence. The 
mean IQ on the total test was 112.7; the com- 
pleted group, once again, made a higher mean 
IQ than the incompleted group, but the differ- 
ence was not statistically significant. No signi- 
ficant differences were found when the two 
subgroups were compared on the verbal or the 
performance subtests, although the completed 
subgroup made higher mean I1Q’s on the verbal 
and performance tests. 





1Q’S 1Q ~ has — Since the individual subtest scores were not 
— = - available, making it impossible to determine 
Advisement 18 80-129 1084 14.2 the extent of mental deterioration as defined 
Incompleted by Wechsler [9], the same crude determina- 
Advisement 5 67-123 89.5 135 1.38 : . : ” 
tion of deterioration—as made in the Califor- 
Both gree = 3 1D 8S _ mia test—was made, and no deterioration was 
TABLE 3 
WECHSLER-BELLEVUE TEST OF INTELLIGENCE 
’ : res No. Range "y ; ~ Mean . SD of Critical Deterioration 
Groups IQ’s IQ 1Q’s Ratio % 
Verbal subtests 
Completed Adv....................... ‘ 9 86-140 109.0 18.0 0 
Ea ae 89-113 101.0 16.9 4 0 
TD oi cinkddhapiacsivesesiacalibilt 11 86-140 107.2 18.1 0 
Performance subtests 
Completed Adv.......................... 9 104-129 117.0 10.3 
See en 2 93-112 102.5 13.4 1.1 
0 ee anne 11 93-129 113.6 11.7 
Total subtests 
Completed............... iinciiahnemeia 9 95-136 114.5 14.1 
I nisciticnpiccioteanatemnentin 2 93-114 103.5 14.9 7 
Both Groups....................--. 11 93-136 112.7 14.0 
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noted. However, on two of the paraplegics 
completing advisement, the Wechsler subtest 
scores were available. One paraplegic had 8 
per cent and the other had 10 per cent deterior- 
ation. The scant evidence of this study would 
seem to indicate that the paraplegics tested as 
a group did not reveal mental deterioration 
to any marked extent, if at all (Table 3). 


When the results of the three intelligence 
tests were combined, a total mean IQ for the 
group of 102 paraplegics was 108.7. The 84 
paraplegics completing advisement were higher 
in mean IQ than the 18 paraplegics not com- 
pleting advisement, but the difference was not 
significant (Table 4). 














TABLE 4 
COMBINED INTELLIGENCE Test RESULTS 
Groups No. Range of Mean SD of Critical 
IQ’S IQ IQ@s_ Ratio 
Completed Adv. 84 75-141 109.7 14.9 
Incompleted 
Adv. 18 67-125 104.0 17.3 6 
Both groups 102 67-141 108.7 15.5 
DISCUSSION 


Because paraplegics receive severe physiologi- 
cal and psychological damage the question fre- 
quently is raised by rehabilitation workers if 
intellectual impairment has occurred. There is 
some slight evidence in previous studies that 
paraplegics were normally distributed in in- 
telligence prior to their injuries. In the group 
of 102 paraplegics studied the mean IQ’s on 
three different intelligence tests indicated that 
this group was of high average intelligence. 
Because this group appeared to be highly moti- 
vated, had fairly high average educational 
achievements, and had a fairly large percentage 
of commissioned and noncommissioned officers 
it is believed that the group was a select one 
and not representative of the paraplegic popu- 
lation in veteran’s hospitals. It is very likely 
that groups of paraplegics not interested in vo- 
cational planning may have lower mean IQ’s. 
However, the impression of the writer is that 
representative groups of paraplegics would not 
fall below the mean intelligence of the general 
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population, but probably be slightly higher 
since selection factors had operated in the se- 
lection of military personnel. 

There was no conclusive evidence that men- 
tal deterioration had occurred in the group 
studied. Only crude attempts had been made 
to determine this. Nevertheless the opinion, 
both from the available test results and from 
personal observations, is that paraplegia per se 
is not a precursor of mental impairment. 


CONCLUSIONS 


1. A group of 102 male paraplegic veter- 
ans had a mean IQ of 108.7. Paraplegics com- 
pleting vocational advisement and developing 
training programs leading to specific work ob- 
jectives had higher mean IQ’s than paraplegics 
not completing advisement. But the difference 
in mean IQ’s was not statistically significant. 

2. No substantial evidence of mental de- 
terioration due to injuries pertaining to para- 
plegia was found. 


Received August 8, 1949. 
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THE EFFECT OF FOCAL BRAIN LESIONS ON RECALL 
AND ON THE PRODUCTION OF ROTATIONS IN 
THE BENDER GESTALT TEST" 

LEO J. HANVIK anv A. LLOYD ANDERSEN 


MINNEAPOLIS VETERANS ADMINISTRATION HOSPITAL 


HIS report is concerned with the Bender 

Gestalt protocols of a group of brain- 
damaged patients in each of whom the locus 
of the cerebral lesion had been made known by 
surgery or through X-ray verification. The 
total group of 44 patients was made up of 20 
patients with lesions in the “dominant” hemis- 
phere (presumed to be the left hemisphere in 
right-handed persons) and 24 patients with 
lesions in the other, or “nondominant,” hemis- 
phere. All were male patients in the Minneap- 
olis Veterans Administration Hospital and all 
had been subjected to fairly exhaustive psy- 
chological study which included, among other 
things, the administration of the Bender Ges- 
talt test. 

PROCEDURE 


The Bender test was administered to all pa- 
tients in the standard manner [1]. In addition 
to having the patients copy the nine drawings 
from the cards, the examiners also tested the 
ability of each patient to draw the gestalten 
from memory, i.e., without looking at the mod- 
el drawings. The protocols were then examined 
with a view to noting what differences, if any, 
obtained between the drawings of the group of 
patients with dominant-hemisphere lesions and 
those of the group with nondominant-hemi- 
spherel lesions. 


In order to compare the two groups, two 
measures capable of quantification were obtain- 
ed: (1) the mean number of figures recalled 
correctly by each patient subgroup, and (2) 


1Published with the permission of the Chief 
Medical Director, Department of Medicine and 
Surgery, Veterans Administration, who assumes no 
responsibility for the opinions expressed nor con- 
clusions drawn by the authors. Gratitude is ex- 
pressed for the aid and cooperation of the Neu- 
rology staff at the Minneapolis Veterans Adminis- 
tration Hospital. 


197 


the per cent of cases in which at least one 
drawing had been distorted by rotating it 30 
degrees or more from the original plane. These 
two variables were chosen, not only because a 
quantitative score could be obtained on them, 
but also because extensive clinical use of the 
Bender test had suggested that the tendency to 
rotate drawings together with a tendency to- 
ward poor recall of the figures might be par- 
ticularly characteristic of brain-damaged pa- 
tients, and that patients with brain lesions in 
different locations might differ with respect to 
these variables. 


To extend the study a little further, a con- 
trol group of other hospital patients was used. 
This control group consisted of patients whose 
presenting complaint on admission to the hos- 
pital was low back pain. These patients had 
been studied as a part of a different investiga- 
tion [2] and had been found finally to be 
made up actually of two clinical types: (a) 
fourteen patients with herniated intervertebral 
disc, (ultimately surgically verified) and (b) 
twenty-three patients whose general physical 
and neurological examinations had been es- 
sentially negative and in whom the low back 
syndrome was deemed to be psychogenic in ori- 
gin. 


As was done in comparing the two sub- 
groups of brain-damaged patients, the two 
subgroups of controls were compared with 
each other with respect to per cent of cases ro- 
tating and with respect to recall. The control 
subgroups did not differ significantly from one 
another on these two variables. In recall score, 
the total control group was approximately the 
same as a large sample of normals studied by 
Welsh (Unpublished research). Data were not 
available on rotations for a normal group. Al- 
so, comparisons on the two variables were made 
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between the nondominant-hemisphere group 
and the control group as a whole, and between 
the dominant-hemisphere group and the con- 
trol group. The results of the latter compari- 
sons together with the significance of the find- 
ings, are shown in Tables 1 and 2. 


TABLE 1 
PERCENTAGE OF CAsES PropUCING ONE oR More 
ROTATIONS IN THE BENDER-GESTALT DRAW- 
INGS: RESULTS OF COMPARISONS 
% 


Brain % 


Dam- Con- Differ- 

aged trols erence CR P 
Dominant- 
Hemisphere Damage: 55.00 18.91 36.09 2.81 <.010 
Nondominant- 
Hemisphere Damage: 62.50 18.91 43.59 3.70 <.001 
Total 4.09 


59.09 18.91 40.18 


<.001 








TABLE 2 
MEAN NuMBER OF BENDER GESTALT FIGURES 
REPRODUCED ON RECALL 











Brain- 

Damaged Control 

Patients Group Differ- 

Mean SD Mean SD ence CR P 
Dominant- 
Hemisphere 
Damage: 3.90 1.97 5.57 1.73 1.67 .64 >.05 
Nondominant- 
Hemisphere 
Damage: 4.67 1.58 6.57 1.73 90 .388 >.05 
Total 4.32 1.81 5.57 1.73 1.25 .50 >.05 





RESULTS 


1. The dominant-hemisphere group is statis- 
tically not significantly different from the non- 
dominant-hemisphere group with respect to 
either the number of figures reproduced on the 
recall exercise or percentage of cases rotating 
the figures. 

2. There are no significant differences be- 
tween either of the brain-damaged subgroups 
and the control group as to number of figures 
recalled. 

3. There are significant differences (P. is 
less than .01) between the brain-damaged 
group and the control group on percentage of 
cases rotating the figures. The brain damaged 
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group has significantly more instances in which 
at least one figure is rotated in the protocol. 


Other facts of interest which came to light 
in these comparisons (and which are trends 
rather than statistically significant differences) , 
are these: 


A. Rotations occur in both “normals” (e.g. 
the patients with herniated disc) and in “neu- 
rotics” (e.g. the cases of probable functional 
low back pain), as well as in patients with 
brain damage. 


B. There is a slight trend toward poorer re- 
call scores among patients with dominant- 
hemisphere lesions, in comparison with patients 
with nondominant-hemisphere lesions. 


C. There is a similar slight trend among the 
nondominant-hemisphere-damaged patients to 
be more prone to rotate the figures than is the 
case among dominant-hemisphere-damaged pa- 
tients. 

SUMMARY 


Statistics are presented relative to the Bend- 
er recall scores, and the percentage of cases ro- 
tating the Bender figures, when the Bender 
protocols of a sample of brain-damaged patients 
are studied. Comparisons are made between 
patients with lesions in the dominant hemis- 
phere and those with lesions in the nondomin- 
ant hemisphere. Comparisons are also made be- 
tween the recall scores and rotative tendencies 
of the brain-damaged patients and measures of 
the same variables for a control group. The 
only difference appearing which is statistically 
significant is the difference between the brain- 
damaged group and control group on percent- 
age of cases rotating the Bender figures. The 
brain-damaged group exceeds the control group 
significantly in this characteristic. 


Received July 18, 1949. 
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A BASIS FOR INTEGRATION RATHER THAN 
FRAGMENTATION IN PSYCHOTHERAPY 


REX MADISON COLLIER 


UNIVERSITY OF ILLINOIS 


‘INCE the appearance of Rogers’ Counsel- 
ing and psychotherapy [9] in 1942, we 
have observed a very rapid increase in the fre- 
quency with which psychologists have given 
attention to both the theoretical and the prac- 
tical problems of psychotherapy. The publica- 
tion of Counseling and psychotherapy was, of 
course, coincidental with the increase in de- 
mand for psychotherapy from clinical psycholo- 
gists in the military assignments. ‘The book not 
only gave the clinical counselor or psycho- 
therapist a technique and a code, but also pre- 
sented the first verbatim transcript of a series 
of recorded therapy interviews. To many psy- 
chologists, it has seemed that, at last, we have 
stated for ourselves a point of view and a 
technique that is not only distinctly psycholog- 
ical but is also completely legitimate for use by 
the psychologist. The next tempting step was 
easily taken, namely, to emphasize contrasts be- 
tween “traditional” psychotherapies and the 
“new” psychotherapy. 

While the purpose of this paper is not to 
evaluate the significance of Rogers’ “nondirec- 
tive” or “client-centered” emphases, reference 
must be made to its influence in current psy- 
chotherapy practice. The purpose of the paper 
is rather to indicate the fragmenting tenden- 
cies in the field of psychotherapy and to sug- 
gest an approach which may aid in constructive 
integration. Divergent emphases often occur in 
a field of science prior to greater unification. 
The history of psychology could illustrate a 
number of these sequences of (a) fragmenta- 
tion, (b) a degree of resolution of the contro- 
versies, and (c) new trends. Twenty-five 
years ago there appeared The psychologies of 
1925 [7] and subsequently The psychologies 
of 1930 [8]. The divergence of theory con- 
sumed considerable time and attention then. 
However, while there are still differences of 
emphasis, it would be far less appropriate cur- 


rently to extend this series. The situation in 
psychotherapy may very well run a kind of 
parallel. It would be possible and even desir- 
able to publish a Psychotherapies of 1950; 
there might well be another such volume in 
1955, but eventually integrating approaches 
should occur which would reduce current con- 
fusion and increase certainty. 

The fact that all therapies have successes and 
all therapies have failures strongly suggests 
that there should be bases of integration. No 
therapy technique can be credited with either 
all the “cures” or all the failures. That some 
therapists function more comfortably and 
effectively with one emphasis rather than an- 
that personality differences 
among therapists may be as important as dif- 
ferences in technique. Clues for integrating 
concepts are not likely to be found, however, 
by studying personality differences of ther- 
apists. A notable current movement to unify 
theory and 


other suggests 


technique of psychotherapy is 
spearheaded by the work of Mowrer [6] in 
his attempt to relate psychotherapy to learn- 
ing theory. 

The thesis of this paper is that bases for 
greater unity should be found in the discovery 
of continuities among the common aspects of 
therapy some of which are emphasized by one 
school, others emphasized by other schools. 
Without attempting an exhaustive list, these 
common aspects may be partially detailed as 
follows: 


1. Techniques 


a. Interpretation 

b. Clarification and reflection of feel- 
ing 
Reassurance 

. Pressure and persuasion 
Questioning 
Free association 


~~ om 
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g- Hypnotic procedures 
h. Drugs 
2. Process and content’ (Typical categor- 

ies for classifying the actual verbal ex- 
changes between therapist and client. ) 

a. Understanding or insight 

b. Discussion of plans 

c. Answers to questions 

d. Rejection or acceptance of inter- 

pretation 


3. Relationship (Reference is made here 
to the relationship between therapist 
and client which may in part be des- 
cribed by the dimensions indicated. ) 


a. Balance of responsibility between 
therapist and client or patient. 

b. Degree of explicit and implicit 
freedom of response (limits) for 
patient. 

c. Degree of mutual empathies. 

d. Degree of mutual acceptance and 
rejection. 

e. Degree of transference and count- 
ertransference. 


Overlapping among the dimensions listed 
under 3 is inevitable; or, more quantitatively, 
most such dimensions would have tendencies to 
converge. It is not the purpose of this paper 
to pursue that aspect of the problem. 


4. Criteria of change? (Indicators of im- 
provement) 


a. DRQ as a measure of change in 
psychological tension [4] 

b. Insight 

c. Increased self-acceptance 

d. Reduction in hostilities 

e. Increases in range of social ac- 
tivities and reduction of old neu- 
rotic restrictions 

f. Increases in reality thinking and 
planning. 


Disagreements and special emphases have 
arisen principally with regard to concepts of 
technique and relationship. Bases of integration 
can, therefore, best be illustrated by reference 


1Cf. Snyder, W. U. [14] for more complete de- 
scription and listing; cf. also Seeman [13]. Sub- 
categories are illustrative without purporting com- 
pleteness. 

2See Curran [2]; cf. also Snyder [14] and See- 
man [13]. 


REX MADISON COLLIER 


to these two areas. The continuum concept* is 
thus far most easily illustrated with reference 
to the relationship between therapist and pa- 
tient. It is not difficult, for example, to see how 
the balance of responsibility for attacking and 
handling the problem can vary between the 
therapist and the patient. Likewise, it is not 
difficult to see that the other aspects of relation- 
ship may vary according to degree or dimen- 
sion. It is not quite so obvious how an aspect 
of technique may vary directionally and become 
the basis of resolving a dichotomy. 


THE INTERPRETATION CONTINUUM 


The Rogerian school has often contrasted 
interpretation with “reflection of feeling.” The 
one is referred to as directive; the other as non- 
directive. The hypothesis here proposed is that, 
if we can assume a continuum described as the 
degree of uncovering or revelatory interpreta- 
tion, then “reflection of feeling” will become a 
segment of the more comprehensive dimension. 
“Reflection of feeling” would, therefore, be a 
form of interpretation and could be rated in 
terms of the degree to which it appears to re- 
veal feelings, tendencies, aspects of personality 
heretofore unrecognized. Demonstration of 
this approach would be a step toward eliminat- 
ing both an unneeded dichotomy and undesir- 
able misrepresentations of psychoanalytic pro- 
cedure. Let the continuum be represented by 
the line with terminal points a and } (Fig- 
ure 1). 





Ficure 1 


a would represent the least degree of uncover- 
ing interpretation while 4 would represent the 
maximum degree of the variable. What seems 
to be steps along this continuum may be illus- 
trated by excerpts from verbatim published 
recordings.* Exactitude in identifying posi- 


Sef. E. S. Borpin, Dimensions of the counseling 
process. J. clin. Psychol., 1948, 4, 240-244. This 
significant article, presenting a somewhat similar 
line of thought, came to the writer’s attention after 
the present paper had heen accepted for publica- 
tion. Extended discussion, therefore, cannot be 
made at this time. 


4Experimental verification of these classificatory 
steps can be made by the method of Thurstone and 
Chave [16] in determining scale values for an at- 
titude scale. 
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tions on the scale or continuum is not the 
present purpose, but rather to indicate that 
continuity exists and the probability of ex- 
perimental definition of position. 


The least degree of uncovering interpreta- 
tion or the extreme left of the continuum 
would be simple repetition of the subject’s re- 
sponse. Examples of this may be found in Ax- 
line’s Play therapy [1, p. 103]. 

a. Jack: Look, I spit on my home.® 

Therapist: You spit on your home. 

b. Jack: I hate my father! I hate my brother! 


Therapist: You hate your father! You hate your 
brother. 


While the therapist’s purpose here includes ac- 
ceptance of the subject (note that several con- 
tinua may be involved in a single exchange be- 
tween therapist and subject) there is the prob- 
ability that the subject has been made slightly 
more aware of his own feelings and impulses. 
Since the technique is designed, in addition to 
being acceptive, to make the subject more 
aware of impulses and feelings which hereto- 
fore had escaped his recognition, the intent of 
the therapist is revelation even though it may 
be only in a mild degree. For purposes of this 
paper, the above example will be called Step I 
on this continuum. 


Some degree of advance, although probably 
small, in the direction of the opposite end of 
the continuum is represented by selecting a 
portion of the subject’s response and repeating 
it or restating it. The therapist uncovers here 
in the sense of pointing out or calling attention 
to what seems to him the more significant as- 
pect of the subject’s response. Examples of 
Step II are as follows: 


Richard: (who is going to see his mother) I'll 
get her something nice. I'll surprise her with a 
nice present. 

Therapist: You'll take her a nice surprise. 

Richard: Yes, I'll be gone two weeks. Maybe 
three. Gosh, it'll be good getting out of here. 

Therapist: You'll be glad to get away from here 
for a while. 

Richard: I'll be ten years old. I’m in the fourth 
grade. My dad wants me in the fifth grade, but 
I tell him I don’t want to be in the fifth grade. I 
like to fail. 

Therapist: You like to fail. [1, p. 123] 


5While attention is directed to the response of 
the therapist, the subject’s previous response must 
be included for context. 


These responses of the therapist are not 
simple repetitions, although some of them ap- 
proach the repetitive. The therapist has select- 
ed and restated. In the last exchange, the ther- 
apist could have selected other aspects of 
Richard’s response, e.g., 


a. Youll be ten years old. 
b. You’re in the fourth grade. 
c. Your dad wants you in the fifth grade. 


d. You tell your dad you don’t want to be in 
the fifth grade. 


The actual selection of the therapist is designed 
(in addition to being acceptive and permis- 
sive) to aid in recognition of a significant at- 
titude. There is, therefore, a greater degree 
of revelation than in simple repetition. 

A still further step (111) in the direction of 
uncovering the significant, latent or unverbal- 
ized aspects of the subject’s response is to re- 
state what the subject has said for the purpose 
of increased clarity or for the purpose of re- 
vealing (reflecting?) an affective tone or trend. 
The increase in clarity is directed toward 
what the therapist perceived to be an improved 
revelation of the problem as currently repre- 
sented. This is somewhat more than selecting 
an aspect of the subject’s response; it restates, 
summarizes or epitomizes for the purpose of 
revealing more clearly to the subject the core 
or implications of what he has said. Therapists 
can restrict themselves to Steps I and II and 
avoid the need for any very penetrating infer- 
ences, but beginning with Step III inferences 
that penetrate beneath the surface become in- 
creasingly apparent. Examples illustrating Step 
III are as follows: 


a. Subject: I never used to worry 
about people or what they thought. I didn’t mind 
what they thought about me or my personal af- 
fairs, until this last year. I didn’t think I was so 
bad until then. In fact, at that time I thought I 
was pretty darn goodlooking. Then I began to 
look at fellows and think, “Now if I had John’s 
nose, and Frank’s eyebrows, and Bud’s hair, and 
Harry’s chin, I really would look like something. 
Then I'd be a swell fellow.” 

Counselor: You sort of wanted to be the best 
of everything. [15, p. 104] 

b. Subject: The only thing is that I 
can’t help but feel that I work against myself. I 
mean it’s just so obvious to me that I do work 
against myself. And it seems rather silly to be 
one minute trying to do something about it, and 
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then it seems so certain that there’s something pres- 
ent that doesn’t want to do anything about it. 
(Pause) It’s such a contrary thing—the whole 
problem is just - - - 

Counselor: So that the very time that one part 
of you is perhaps really taking hold of this whole 
situation, another part of you is just sabotaging 
the whole business. [15, p. 173] 


Another step (IV ) in the direction of a great- 
er degree of revelatory interpretation is repre- 
sented when the therapist combines previous 
statements of the subject with current state- 
ments for the purpose of stimulating integra- 
tive insights. —The uncovering aspect of the 
therapist’s response is evident in his attempt 
to bring to a clearer conscious level personality 
trends that heretofore had been unrecognized. 
The published verbatim material representing 
“‘nondirective” technique shows few, if any, 
examples of this. Psychoanalytic material 
would contain a high frequency of this type of 
therapist response. Analyst responses, however, 
do not begin at this point, for they may be 
found to spread over the whole range of this 
continuum. Examples of Step IV are as fol- 
lows: 

Dr. Do remember had the same 
feeling in connection with mother, for whose health 
you were concerned, and that you behaved as if 


you were she? Do you remember that I reminded 
you of a statement of little Ralph 


Pt. I do 


Dr. . . . . who once said “If I don’t stop be- 
having so badly, it will kill her.’ Do you remem- 
ber you said it? 

Pt. That’s right. 

Dr. And that you had to run home to see wheth- 
er she still and that had a bad 
conscience as if you had done something wrong? 
Then you said something wrong means: something 
is wrong with your heart or you did something 
wrong. (Pause). Now, you said when your girl 
friend complained about her heart, you felt as if 
you were she. [3, pp. 132, 133] 


you that you 


were alive you 


In suggesting and illustrating hypothetical 
Step V, it should perhaps be pointed out again 
that the writer is not here attempting to indi- 
cate with any degree of precision these posi- 
tions. That remains for experimental deter- 
mination. Furthermore, we must avoid the im- 
plication at this time of equal distances be- 
tween these reference points. There is prob- 
ably greater distance, i.e., psychological dis- 
tance in terms of judges’ ratings, between III 
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and IV and between IV and V than between 
1 and II or even between I and III. The 
writer's’ hypothetical conception of this con- 
tinuum in terms of the foregoing and subse- 
quent illustrations is shown in Figure 2. 


> il Iv v 
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FIGURE 2 

The dotted extensions simply mean that the 
limits have not yet been explored nor defined. 
For example, at a there could conceivably be a 
mechanical repetition of precisely what the pa- 
tient had previously said. Experimental explor- 
ation of the effectiveness of this therapy has 
not been made, but it is a plausible procedure. 
At the other extreme 4, the work of Rosen 
[12] suggests that the dimension in extendable 
beyond even traditional analytic technique. 

The region of Step V is characterized by a 
deeper uncovering type of interpretation. The 
more elemental and the more deeply uncon- 
scious tendencies of the personality are here 
the objective of the therapist’s techniques. The 
uncovering of oral fixations, castration com- 
plexes, strongly rejected incest or homosexual 
tendencies represent the region of Step V. Ver- 
batim illustration of therapists’ procedures and 
verbalizations are difficult to find since analysts 
have only just begun to record their interviews. 
The following two excerpts from apparently 
verbatim material illustrate approximations to 
Step V. The first example would probably be 
classified somewhat to the left while the second 
might be somewhat closer to this last position. 


a. Dr. Without 
wouldn’t have 


this anxiety, probably 
heeded your aunt’s warning, 
you probably would have had sexual intercourse 
with Carl. From this point of view, the anxiety 
was a valuable help, because your sexual desire 
had destroyed your ability to resist your cousin. 
If your aunt’s warning had appeared without anxi- 
ety, you probably would not have had the power 
to interrupt your sexual play and to escape un- 
harmed. That was the situation at the time. To- 
day, however, sexual intercourse is no longer a 
danger. Now you may conceive a child without 
being blamed for it. But you still behave as if 
you were a little girlk Why? [5, pp. 59, 60] 


you 
and 


b. His voice was tired, as he said—“I’m still 
arraid of cats. ...” 

I drew a long breath. “Perhaps there’s a con- 
nection between this attempted suicide and your 
phobia. When you decided to die, you had to cross 
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the street to go to the river. Perhaps the crossing 
of a street may represent, in your Unconscious, the 
wish to leave home in order to die.. The fear you 
feel when you try to cross the street, would then 
be a repetition of that childhood fear when you 
were attacked by a cat. It would serve as the 
warning — 
“Don’t do it again!” [Ibid, p. 227] 


That such therapist responses can be classi- 
fied along a continuum such as the one de- 
scribed above implies neither their validity re- 
garding personality dynamics of the subject nor 
the appropriateness of the response. These 
questions are irrelevant to the immediate prob- 
lem of identifying these continua and their re- 
lational characteristics. It will be indicated 
later, however, that research on the effective- 
ness of therapy can be closely related to the 
dimensional concepts described here. At the 
moment it is important to point out that ac- 
ceptance of even the one continuum just de- 
scribed reduces the appropriateness of empha- 
sizing sharp demarcations between “non direc- 
tive” technique and analytic technique. With 
regard to the technique of interpretation identi- 
fied by this continuum, there is rather a gradual 
transition from one to the other with no nec- 
essary implication that the therapist becomes 
highly directive even at the opposite end of the 
continuum. Rogerian reflection of feeling may 
be represented as a segment near the a end of 
the continuum. To insist on limiting oneself 
to a particular segment of a dimension is, of 
course, possible but does not serve to make the 
technique distinctive. Nor is it desirable in the 
interests of fundamental clarity to insist on 
such sharp dichotomies as Rogerians frequent- 
ly do. 

If we are truly “client-centered”—and no 
therapy worth recognition is centered toward 
any other objective—then we will reduce our 
emphases on whether we are directive or non- 
directive, analytic, Meyerian, Adlerian, eclec- 
tic or otherwise; we will tend less toward be- 
ing protagonists of “doctrines” and give atten- 
tion to the needs of the client or patient. The 
hypothesis here submitted is that the needs vary 
from one client or patient to another and from 
one stage of therapy to another. Furthermore, 
in terms of these basic dimensions, the prob- 
lem becomes one of determining the segments 
of the dimensions which should be emphasized 
for particular kinds of patients or problems and 
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at successive stages of therapy. Rather than be- 
ing rigid about the pattern of segments to be 
used or emphasized because of doctrinaire ori- 
entation, therapy that is truly client-centered 
and not arbitrarily restricted should be sufh- 
ciently flexible so that adaptation to the im- 
mediate needs of the client may be made. ‘This 
suggested approach should provide a basis for 
unification of trends rather than for greater 
diversification. Sharp boundaries and dichoto- 
mies tend to blind one to the contributions of 
those whose results may have fundamental rel- 
evance to our own. It is pertinent here, also, to 
point out that no segment of such a continuum 
becomes necessarily the exclusive property of 
any particular group. 


THE RESPONSIBILITY-BALANCE CONTINUUM 


The limits of this paper prevent the discus- 
sion and illustration of more than one dimen- 
sion or continuum under each of the categories 
of technique and relationship. Reference is 
made now to the one deseribed as “balance of 
responsibility between therapist and client” for 
attacking the problem. The assumption that 
this responsibility balance is continuously vari- 
able obviates certain current confusions that be- 
set students in their training in psychotherapy. 
The writer suspects this confusion to involve 
even the experienced therapist at times. At one 
extreme the relationship is structured so that 
the patient is entirely responsible while at the 
other extreme the therapist becomes completely 
responsible. The latter is approximated when 
the therapist actively elicits a case history from 
a patient. The former may be observed near or 
at the termination of a therapy period when, 
with increased maturity and insight, the pa- 
tient plans his future activities. Again, it be- 
comes essentially a question of what segment of 
this continuum is appropriate for a particular 
case, for particular stages of therapy, and for 
special social fields*. Experimental procedures 
will eventually reveal the answers to such 
questions. 


While it is obvious that the case and therapy 
stage variables should affect this continuum, 
the implications of the social field within which 
therapy takes place are sometimes overlooked. 


6O. H. Mowrer, after reading this manuscript, 
has aptly suggested that the segment pattern would 
vary also during a particular therapy session. 
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Frames of reference are subtly derived from 
social fields and often complicate the problem 
of communication. One could approach the 
problem of therapy as essentially a problem 
in communication. It would then be necessary 
to concern oneself extensively with the frames 
of reference of both patient and therapist. 
These frames of reference are, of course, the 
psychological contexts out of which issue speci- 
fic verbalizations. The determination of the de- 
gree and frequency of coincidence of the frames 
of reference of therapist and patient might be 
particularly enlightening along with measures 
of therapy effectiveness and progress. 


Specific application of this point may be made 
to the procedure of structuring initial relation- 
ships in therapy. Much has been said and im- 
plied in the literature about proceeding so that 
your client feels that it is his responsibility to 
produce, that he doesn’t shift responsibility to 
the therapist for action and that the therapist 
must not behave as though he could solve an- 
other person’s problems for him. The appro- 
priateness of this approach will be found to 
vary with the social field. Consider patients in 
a hospital to be given psychotherapy. The coun- 
selor might say at the first interview something 
like this: “I can’t solve your problems for you. 
No person can solve another person’s problems 
for him. It is really up to you. Now, I’ll see 
you every day at ten o'clock and we can dis- 
cuss anything you like. Is there anything you 
would like to talk about today?”’ While this 
may be nondirective in the textbook sense, it is 
far too rigid an observance of rule to allow the 
therapist to begin by trying to understand the 
needs of the patient, why he came to the hos- 
pital and what he expects. 


What is a typical frame of reference of a 
patient coming to a hospital? Traditionally, 
one goes to a doctor or to a hospital to have 
something done to or for him. He gets a surgi- 
cal operation, a physical examination, recom- 
mendations concerning diet, activities, intake 
of medicines, etc. The patient comes to a hos- 
pital, generally, because his efforts have failed ; 
he comes as a last resort; if he could have help- 
ed himself, he would have done so and saved 
his time, his job and stayed among family and 
friends. With such a frame of reference, it 
must be a devastating blow to be told by one 


7Cf. Rogers [10]. 
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who represents the hospital, “I can’t help you; 
it’s up to you.” or some similar verbalization 
of nondirective structuring. From the patient’s 
point of view this is tantamount to a rejection 
of him and his problem. The therapist is more 
concerned here with being “correct” than in 
understanding his patient. 


Conflict about the directive-nondirective is- 
sue seems to be resolved by applying the con- 
cept of the responsibility-balance continuum. 
In the social field of the hospital, the therapist 
acts more appropriately if he adopts initially a 
mode of behavior and verbalization that repre- 
sents a rather high degree of therapist responsi- 
bility. In most instances the writer has not 
found it difficult gradually to shift this balance 
more and more to the patient as therapy pro- 
gresses and as the patient’s strength and confi- 
dence increase. Change with therapy is more 
similar to the growth of a child toward greater 
maturity than simply being analogous. The 
wise parent provides the child with opportu- 
nities to assume those responsibilities that are 
commensurate with his current capabilities. 
The wise therapist should behave in the same 
way. Effective therapy like effective child 
training eventually arrives at maturity and 
independence of the individual. With the pa- 
tient as with the child, too early an insistance 
on independence and self-direction is equiva- 
lent to rejection. With children, fear and hos- 
tility often result; with patients, it is not un- 
common for them to reject the institution or 
the therapist, or both, as inadequate. One 
may suspect that both children and patients 
have some justification on their side. 


With the concept of the responsibility-bal- 
ance continuum, it becomes less appropriate to 
emphasize dichotomies of directive and nondi- 
rective for this aspect of the therapy relation- 
ship. The problem becomes rather what seg- 
ment of the continuum is associated with thera- 
py progress under specified conditions. This 
may be subjected to experiment following ex- 
perimental definition of continuum positions. 
These positions could be illustrated by excerpts 
from verbatim recordings as was done in an 
earlier part of this paper. Such positions could 
also be experimentally identified by the Thur- 
stone technique to which reference has also 
been made. 

In a social field such as a college counseling 
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bureau, it would be possible to initiate therapy 
at a point on the responsibility-balance dimen- 
sion much farther in the direction of the client 
than if the therapy takes place in a hospital. 
Students seek help through counseling but gen- 
erally do not approach the service with as much 
dependence as patients in a hospital since they 
are not only still in the social field of the col- 
lege campus, but also have not found their own 
efforts at adjustment so completely futile. 


OUTLINE OF RESEARCH PROPOSAL 


On the basis of the foregoing discussion, the 
following research seems appropriate: 

1. Identification and initial generalized de- 
scription of dimensions or continua in the areas 
of relationship and technique. 

2. Development of dimensional scale values 
(Thurstone method) for therapist responses 
in successive sections (fourths, fifths or sixths) 
of transcribed completed series of cases. 

3. Analysis of content of patient responses 
in terms of criteria of change or progress. 

4. Graphic representation and correlation 
of scale-value change for the various continua 
with quantitative expressions of the criteria 
of progress. 

5. Final determination of the segments of 
continua of relationship and technique most ap- 
propriate with respect to (a) type of case or 
problem, (b) stage of therapy, and (c) re- 
lated social fields. 


SUMMARY 


A basis for integrative or convergent trends 
among the current disparate emphases in psy- 
chotherapy has been described. This basis is 
found in the proposed identification and quan- 
titative description of continua or dimensions 
in both technique and therapist-patient rela- 
tionships. Such continua are suggested as pro- 
viding a basis for including all the aspects of 
technique and relationship that have been 
found effective in psychotherapy. The continu- 
um concept is offered as a potential resolution 
of dichotomies such as “directive” and “non- 
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directive.” The needs of patients should final- 
ly be thought of in terms of the appropriate 
pattern of segments of these dimensions for 
particular cases, for particular stages of therapy 
and for variations in the social field within 
which therapy occurs. Concrete proposals for 
the application of the continuum concept to re- 
search in psychotherapy are offered. 


Received August 16, 1949. 
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CURIOUS situation has prevailed in 
the field of counseling in the last half- 
century. The client has stood in the focus of 
interest and, therefore, has been the center of 
research and discussion. The counselor was ac- 
cepted as a person who, equipped with proper 
skills and good personality and judgment, 
faced the client’s problems and attempted to 
deal with them. It was the psychoanalytical 
group which insisted that this was a miscon- 
ception and that the counselor, his personality 
and problems, were as much a part of therapy 
as the client. 

A survey of the literature of the last ten 
years reveals significant meagerness of publica- 
tions dealing with the problem of “the coun- 
selor in the therapeutic process.” 

Neo-orthodoxy in therapy, which is repre- 
sented by the nondirective approach, emphasiz- 
ing client-centeredness, repeats the assumption 
that the therapy situation primarily involves 
the client. The counselor creates an “objective, 
friendly, permissive atmosphere.” The problem 
of the counselor’s feelings, involvements, needs, 
and tensions is barely taken into account. In 
his last article [2], Rogers discusses at length 
what the counselor should feel and do. He is 
very taciturn as to how this can be achieved or 
whether it can be achieved at all. Wyatt [3] 
takes a very different view on the problem of 
the counselor. He sees the counselor gratify- 
ing needs in the counseling process as well as 
meeting personal problems. The counselor is 
as much involved, reactive, and situation-cen- 
tered as the client. No artificial distinction is 
made between the client, who needs objective 
permissiveness, and the counselor, who seems 
to face little difficulty disseminating these 
conditions. 

This writer agrees, to a considerable extent, 
with Wyatt’s concept of a counselor. Any re- 


a 
a 


flection or interpretation given to the client by 
the counselor is the result of a selective process. 
The counselor considers many possible ways 
of looking at what the client is saying, elimin- 
ates a number of them, and finally chooses one 
particular way of responding. This selection is 
made by the counselor; it is his choice which 
determines the nature of the reflection or in- 
terpretation. Any reflection or interpretation, 
whether accepted or rejected by the client, 
structures the client’s way of looking at himself 
and at others. If he accepts what the counselor 
has said, this means that he sees himself in the 
way in which the field has been structured for 
him. A rejection of the counselor’s interpreta- 
tion or reflection, however, does not necessarily 
indicate that the client’s point of view has not 
been changed. The mere fact that the client 
rejects a suggestion coming from the counselor 
has dynamic implications. It can be assumed 
that any activity by the counselor structures 
the client’s perceptions and, therefore, to some 
extent is directive, counselor-involved. 

The client has a social and personal rela- 
tionship with a counselor. He likes or dislikes 
his counselor according to his needs, and only 
rarely is neutral to him. The client perceives 
the counselor as a person, someone whose ut- 
terances carry weight—sometimes even the 
pain of insight. Because of this, the counselor 
must not perceive himself as an abstractum, 
thinking in terms of the client’s actions and re- 
actions and the significance they might have. 

During the interview the counselor is part 
of a social situation, which, like other social re- 
lationships, involves emotional interaction. The 
counselor affects the client in an individual 
way, a way which is characteristic of his own 
personality. The counselor is involved with the 
client simply because both are human beings 
obeying the laws of human interaction. 

The actions of a counselor with respect to a 
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client are two-fold: He influences the client 
consciously by giving reflections and interpre- 
tations, and he affects the client unconsciously 
with the weight of his own personality and 
personality needs. The nature of the reflections 
and interpretations is often characteristically 
influenced by the personality needs of the coun- 
selor. The questions then arise: How can the 
counselor guard himself against the danger of 
excessively projecting his own personality into 
the counseling process? What precautions can 
be taken against counselor involvement ? 

Some schools of therapy believe that it is 
imperative for every counselor to undergo a 
psychoanalysis before engaging in any kind of 
therapeutic work. Although the writer agrees 
generally with this assumption, the physical 
impossibility of translating this idea into real- 
ity must also be recognized. In addition, it is 
questionable whether a psychoanalysis is a pre- 
requisite to therapeutic work done on the coun- 
seling level. There is little doubt, however, but 
what workers dealing with milder adjustment 
difficulties have to guard themselves against in- 
volvement as strenuously as those dealing 
with complex and severe maladjustments. 

It is generally assumed that the recording 
and replaying of interviews accompanied by 
discussions with co-workers is one way of 
guarding against involvement. This kind of 
analysis makes it possible for the counselor to 
retrospectively become aware of his often gross 
projective tendencies. But it is during the time 
that the interview is actually taking place that 
the client is exposed to the falsifying factor of 
involvement. It is during the interview, when 
the important components of the therapeutic 
process are in the making, that the counselor 
needs ways of detecting and dealing with his 
involvement. Some possible ways in which the 
counselor might deal with this problem will be 
considered in the following paragraphs. 

During the interview the counselor, appre- 
hensive of his own as well as the client’s re- 
actions, should be aware of and familiar with 
two processes. First, he must recognize indica- 
tions of unconscious involvement and be aware 
of their appearances. Secondly, at various times 
during the interview he must attempt to draw 
back from the interactive process and determine 
the nature of his own functioning. It is sug- 
gested that this stepping-back process be called 
“disengagement.” It is the experience of this 
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writer that both of these processes can be 
learned and practiced successfully in the inter- 
view situation by any qualified counselor. 
When handled with proper skill and sensitivi- 
ty, these processes will not damage or inter- 
fere with the counseling interview. 


SIGNS OF COUNSELOR INVOLVEMENT 


In his interaction with the client, the coun- 
selor is often subject to obstructions from with- 
in himself. These obstructions are indications 
of counselor involvement. For example, the 
counselor finds his thoughts wandering away 
from the interview. It is difficult for him to 
concentrate, and he suddenly finds it hard to 
follow the client. Contrary to his previous ex- 
periences with this client, he finds that he can- 
not completely grasp the material which is be- 
ing presented. He experiences a lack of sensi- 
tivity toward the client. The counselor may 
think that he is not up to par on this occasion, 
but it is possible that these manifestations are 
indications of resistance to some of the materi- 
al under discussion. Perhaps some _ specific 
topic brought up by the client contained mater- 
ial which by association is problematic to the 
counselor himself. It is possible that the coun- 
selor is unaware of his own problem and 
therefore, is also unaware of the effects of the 
client’s statements upon him? There is, of 
course, a possibility that the client’s presenta- 
tion is actually confused. If, however, the 
dificulty which the counselor experiences in 
trying to comprehend and follow the client is 
a sudden one and is followed by disinterest 
and wandering thoughts, it seems very rea- 
sonable to believe that these manifestations 


are symptomatic of counselor involvement. 


Another similar phenomenon is the sudden 
appearance of unrelated thinking in the coun- 
selor’s awareness. He may have a sudden, per- 
sistent thought which seems to have no con- 
nection with the subject under discussion. For 
example, the counselor may experience an urge 
to walk out of the room, although there seems 
to be no obvious reason for this wish. Another 
instance might be the sudden appearance of 
images in thinking, the images being vivid, 
strong, strange, and unrelated to the interview 
in progress. The counselor may attempt to re- 
press these manifestations, but cannot control 
their reappearance. These phenomena are di- 








208 


rectly related to the counseling process; they 
indicate the active presence of repressions. The 
counselor is involved, but he is unaware of his 
involvement. 

At times the counselor may feel that the in- 
terview is not moving along as it should. Al- 
though the client is not especially reticent or 
inhibited, he gives little information. During 
the interview one impasse follows another. The 
counselor feels at a loss as to what to do next. 
Although he feels that he should do something, 
he lacks his usual, calm patience. Perhaps he is 
not aware of the strong feelings of insecurity 
he experiences in his relationship with a given 
client or during one particular interview. The 
sudden urge to be active in an interview, to di- 
rect it, may be an indication that the counselor 
is not detached. 


A counselor sees many clients on a certain 
day, and to his surprise, many of them seem 
to show similar problems with similar dynam- 
ics. He notices that the material brought forth 
that day by different clients lends itself to simi- 
lar interpretations; the adjustment difficulties 
seem to have a common pattern. This similari- 
ty of patterning and interpretation is often 
more than accidental. A careful review of the 
interviews with a co-worker often reveals a 
sensitivity of the counselor toward some speci- 
fic aspect of the clients’ problems rather an co- 
incidental similarity of problems. 


While the counselor is trained and often can 
recognize negative feelings toward a client, he 
is much less sensitive to his too positive feelings. 
If the client stimulates strong feeling of liking 
and sympathy in the counselor, these feelings 
frequently pass unnoticed. The counselor is not 
aware of this kind of involvement. The coun- 
selor may identify with the client in certain 
respects or he may be attracted to the client 
and develop strong emotional ties to him. 
Oversympathy and marked liking for a client 
falsify the counseling process as radically as 
hostility and dislike. Signs of overliking are 
often manifested in the counselor’s behavior by 
overeagerness, increased verbal participation, 
an urge to introduce himself into the conversa- 
tion, a sudden directiveness, and many others. 

A very real problem to many a counselor is 
the one of setting his own limits. Some coun- 
selors set their limits of therapeutic activities 
very low. They hesitate and avoid entering the 
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realm of what they label “deeper problems.” 
This avoidance of therapy under the arbitrary 
heading “deep problem” is often a sign of the 
counselor’s need to avoid involvement to any 
degree under all circumstances. Other counsel- 
ors have no hesitation to tackle any problem 
the client may bring, disregarding the factor 
of their own competence and preparedness. 
This, frequently, is indicative of the counsel- 
or’s need to get involved. Both instances indi- 
cate indirect counselor involvement in the 
counseling process. 

The recognition of “signs of involvement” 
is of little value to the counselor if he is unable 
to follow these signs through to an understand- 
ing of their meaning during the time that the 
rapidly changing interview is still in progress. 


DISENGAGEMENT 


In order to more objectively evaluate the 
dynamics of the interview in progress, the coun- 
selor should repeatedly attempt to ask and an- 
swer the following questions: Do I notice any 
signs of disturbance within myself? Am I 
functioning normally? Do I feel comfortable? 
In order to evaluate his own feelings and ac- 
tions in the light of the movement of the inter- 
view, it is necessary for the counselor to step 
back from the situation involving the client and 
himself. It is suggested that his step be called 
“disengagement” and that it be made a part of 
every counseling interview. 

Speed and intensity are two important fac- 
tors in the interview. Speed is defined here as 
the rapidity with which the client divulges in- 
formation or deals with it. Intensity is defined 
as a measure of the emotional involvement a 
client experiences while divulging or dealing 
with material. These factors frequently overlap 
in an interview. In every interview there are 
high and low points of interrelated speed and 
intensity. At a low point, the client seems to 
be at a standstill. At this time he may repeat 
material or restructure it. His feelings may be 
at a low pitch. Such points of low intensity 
and low speed permit the counselor to disen- 
gage himself without damaging his rapport 
with the client. At such times the counselor 
can take stock of himself and his feelings. He 
can review the material and determine the ade- 
quacy of the reflections and interpretations 
given to the client. It is possible for him to ex- 
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amine his own conduct and watch for signs of 
involvement. The counselor removes himself 
from the place of an active participant, but he 
must remain alert to the client’s reactions and 
sensitive to any sudden change in the situation. 


If the results of the self-inventory are nega- 
tive, it may be possible for the counselor to re- 
sume his activities with more awareness of the 
dynamics of the interview and with increased 
confidence. 

In cases where the counselor discovers signs 
of involvement, he may not be able to discover 
the deeper lying sources of his disturbance. 
Without special and prolonged training, this 
is frequently extremely difficult. He can, how- 
ever, take adequate precautions which will pre- 
vent damaging projections of these disturbances 
into the counseling process. The counselor, for 
example, may decide immediately to take a 
more reserved and cautious attitude toward the 
client. He may limit the quality and quantity 
of interpretations to an absolute and well-con- 
trolled minimum. He may desist from any step 
which would increase the speed and intensity 
of the interview. Last, but not least, the coun- 
selor may be alerted to the necessity to consult 
with associates in order to determine more 
clearly the nature, significance and the grav- 
ity of the disturbances. 


SUMMARY 


This writer assumes that the client and the 
counselor both face a new, unstructured, and 
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often difficult situation in the process of thera- 
py. Both personalities are involved. Both have 
needs to be gratified, adjustments to make. Re- 
search has given valuable information on some 
aspects of the client’s needs, feelings, and re- 
actions. Research now has to investigate the 
personality and the function of the counselor. 
The counselor and the client are made of the 
same material, and both are alike in many more 
ways than they are different. We to 
learn the properties and functions of the tool 
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called “‘the therapist” in order to understand 
more fully his effect on the client. 

The counselor in the counseling process can- 

not help but be affected at various times by the 
client’s reactions. It is important that these 
igns of involvement be recognized in order 
that projections of the counselor’s problems 
into the counseling process be forestalled. The 
process of disengagement is helpful in deter- 
mining counselor involvement. The recogni- 
tion of involvement enables the counselor to 
take precautions against projections. 
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AN INVESTIGATION OF NONDIRECTIVE GROUP 
THERAPY WITH STUDENTS IN 
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HE success of nondirective therapy in 

helping maladjusted college students to 
move toward better adjustment has directed 
attention to its possibilities for students with 
academic difficulties. It is suggested that these 
students might well need, for improvement in 
learning, a better understanding of themselves 
as well as training in the skills of study tech- 
nique. A study of the educational literature 
reveals a number of individuals who are in- 
vestigating the nondirective group therapy ap- 
proach to education. These studies have con- 
tributed much toward the early exploration of 
this seemingly profitable methodology. Among 
the recent writers who have presented studies 
in this area are Shedlin [7], Peres [5], Ba- 
ruch [1], Faw [2], Schwebel and Asch [6], 


and Landsman [3]. 


Arthur J. Shedlin [7] describes a course in 
which the instructor or counselor “attempted 
to conduct all sessions in a consistently permis- 
sive student-centered manner allowing the in- 
itiation and direction of the discussion to be 
completely determined by the values and in- 
terests of the group.” The results were re- 
ported as favorable with only one negative re- 
sponse on self-evaluation made at the termin- 
ation of the course. Shedlin [7] felt that the 
favorable results were due largely to the atti- 
tude of the instructor and the climate which 
he developed. 


One of the more detailed studies of non- 
directive therapy in the group form is the in- 


1This research was sponsored by the School of 
Education of Syracuse University and is adapted 
from a paper presented at the 1949 meeting of the 
American Psychological Association in Denver, 
Colo. before Division of Educational Psychology. 
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vestigation by Peres [5] which has analyzed 
the content more than the results of this 
process. Baruch [1] has used similar tech- 
niques with classes although she has concen- 
trated most of her study in the therapeutic re- 
sults with less attention paid to the more gen- 
eral educational values. Faw [2], and also 
Schwebel and Asch [6] report favorable re- 
sults in psychology classes in tests of subject 
matter and in subjective reports of class mem- 
bers. 


By contrasting a nondirective counseling 
class with a conventional mental hygiene class, 
Landsman reports that students reacted more 
favorably to the nondirective class with respect 
to personal growth, information, long term 
value, and other categories. Landsman’s [3] 
evaluation methods are essentially student- 
centered, lacking in objective criteria, and are 
primarily subjective reports of students which 
are analyzed quantitatively. While this is con- 
sistent with the educational methods employed, 
it departs to an extent from traditional evalu- 
ative methods. 


BACKGROUND AND PURPOSE OF THE STUDY 


For several years classes in Academic Me- 
thods have been held for Syracuse University 
students. Ihe students attending these classes 
were those who through pretests, or actual 
failure in academic work, demonstrated a lack 
of mastery of reading and study skills. The 
classes were conducted in a conventional man- 
ner. The instructors held both lecture and 
laboratory sessions during which students were 
taught to read more effectively and to apply 
well known study skills to their academic work. 
The continuous search for better classroom 
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methods in these Academic Methods courses 
prompted this study. This research is aimed at 
exploring the utility of nondirective group 
therapy for the Academic Methods classes. 

The emphasis of nondirective therapy is in 
the understanding and acceptance of the self 
or the self-concept. Some few modern authors, 
Lecky [4], Snygg and Combs [9], have hy- 
pothesized concerning the crucial importance 
of the student’s attitudes toward himself. They 
indicate that, at the root of many a student’s 
failure is not the inability to master study 
methods but a concept of self which is inade- 
quate. Snygg and Combs [9] stress the impor- 
tance of the self concept in all phases of ad- 
justment and specifically with regard to its 
position in determining the success or failure 
of a student. 


Sheldon [8] has added experimental evi- 
dence for this point in his study of forty col- 
lege students with academic difficulties. He 
found that the failing students studied ranged 
in intelligence from normal to very superior 
but that these forty individuals were severely 
disturbed emotionally. Sheldon has suggested 
that an intensive survey of the emotional stat- 
us of entering college students should be made. 
A better understanding of himself will, it is 
hypothesized, enable the failing college student 
to move toward a more adequate self concept 
or a better academic adjustment. The result- 
ing change in attitude should enable the stud- 
ent to improve not only in his academic work 
and social life but should lead to a greater all 
around adjustment. In order to investigate 
this thesis, the study was made. 


SELECTION OF SUBJECTS 


A group of twenty-eight freshmen, whose 
performance during their first semester of col- 
lege was below expectation, were invited to 
participate in the course in Academic Methods. 
According to the results of their entrance ex- 
aminations all of the students in the study were 
potentially capable of succeeding in college 
work. These students were divided into two 
classes using the matched pair technique. The 
students were paired roughly according to in- 
telligence as measured by the Ohio State Psy- 
chological Examination, general adjustment 
by the California Test of Personality, and 
reading ability with the Iowa Silent Reading 


Test, and grade point average. 

According to Table 1 the variances were 
homogeneous and there were no significant 
differences between the means of the two class- 
es for the California Test, the Ohio State Psy- 
chological Examination, and the grade point 
averages prior to the study. 


TABLE 1 
Tue ¢ VaLurs BeTween MEANS AND THE F RATIOS 
BETWEEN VARIANCES FOR Tests ADMINISTERED 
Berore EXPERIMENT 


Test df F t 


California Test of Personality 26 2.133 -6048 
Ohio State Psycho- 

logical Examination 26 
Iowa Silent Reading 

Test 26 1.295 .0909 
Grade Point Averages 26 1.3504 1.3793 


METHODS OF CONDUCTING CLASSES 


The twenty-eight students met three times a 
week in Academic Methods. The two groups 
met together each Monday and were lectured 
on such subjects as the use of time, compre- 
hension, and rate of reading, memorization, 
concentration, notetaking, and studying for ex- 
aminations. The lecture was followed by a 
question and answer period, demonstrations, 
experiments, and tests. At this joint weekly 
meeting the conventional method of instruction, 
lecture, and discussion was used. 


The total group was then divided into two 
separate groups which met two additional 
times each week. Group A, the control grovp, 
was taught in the conventional lecture and dis- 
cussion manner by the instructor of the joint 
sessions. The methods included lectures and 
directed group discussions by the instructor, re- 
quired reports made by students, supervised 
practice in reading and study skills. 


Group B, the experimental group, spent its 
two periods in nondirective group therapy 
sessions led by a competent nondirective thera- 
pist. Recordings were made of both groups 


throughout the semester using the Sound- 
Scriber. 


THE NONDIRECTIVE GROUP THERAPY SESSIONS 


Since there is practically no standardization 
of meaning for the term “nondirective group 
therapy,” it is advisable to clarify specifically 
the methods used in the class sessions. Em- 
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phasis is primarily placed upon an “atmo- 
sphere” which is created in the classroom situ- 
ation. This is both a difficult and limitless task 
since the atmosphere must be defined in terms 
of permissiveness perceived by each individual 
member of the group. The instructor care- 
fully attempted to remove the traditional 
classroom threats such as the authoritarian 
view of the instructor, the fear of criticism, and 
the necessity for meeting arbitrary standards. 

Classroom techniques rested largely upon 
the now traditional “reflection and acceptance 
of feeling.” It is to be emphasized that al- 
though even now the authors lean toward a 
broader and more liberal interpretation of 
nondirectivity in the classroom, at the time of 
the study a more rigid concept was adhered to. 
The direction of the techniques employed was 
clearly toward increased independence of the 
individual student. This is distinguished from 
the control situation in which students relied 
upon the instructor to provide the best tech- 
niques for studying. 

The participation of the instructor was pri- 
marily in (a) creating the atmosphere of ac- 
ceptance, permissiveness and warmth, (b) re- 
flecting feelings of students as expressed in the 
classroom discussion, (c) structuring the 
course verbally when needed. This includes 
“putting over” the point that the course would 
be different from others, the instructor would 
not lecture, etc., and (d) providing materials 
and equipment but only when specifically re- 
quested to do so by a student. 


Members of the nondirective class were not 
discouraged from dropping out of the course 
when they wanted to. A classroom situation 
where a number of students were working out 
crossword puzzles in the college newspaper 
was not unusual. At the same time, not un- 
usual were situations where the students were 
painstakingly discussing and working out prob- 
lems surrounding negative attitudes toward in- 
structors, feelings of deep inadequacy, fear of 
examinations, dissatisfaction with high school 
preparation, resistance to family “pushing,” 
and social and personal adjustments. 


Each member of both groups was asked to 
submit a written report or a reaction report 
once a week, containing his reaction to the 
methods used in his group. Individual inter- 
views were given to members of either group 
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upon request. A personal evaluation of the en- 
tire course was requested from each student 
at midsemester and end of the semester. 


TREATMENT OF THE DATA AND RESULTS 


At the close of the semester, all students in 
the study who could be contacted were retested 
with the California Test of Personality and 
the lowa Silent Reading Test. Grade point 
averages, the traditional college evaluating de- 
vice, were accumulated for the close of the 
semester during which the study was made. 
One year follow-up studies were made with 
respect to whether or not the students survived 
the academic storms which followed for them. 

The weekly reaction reports were analyzed 
for the purpose of detecting trends in the de- 
velopment of individual students or in each 
group as a unit. These reports were studied to- 
gether with the recordings which were made 
on corresponding dates in order to determine 
any existing relationships between the method 
of conducting the sessions and the reaction of 
the classes. 

As indicated in Table 2, variance ratios for 
the two groups were found to be homogeneous 
but no significant differences between the 
means were found after the experiment for 
either the California Test of Personality or the 
Iowa Silent Reading Test. 

However, variance ratios were homogeneous 
and a difference significant at the one per cent 
level of confidence was found for the academic 
grade point average immediately after the ex- 
periment in favor of the experimental or non- 
directive group. 


TABLE 2 
VARIANCE RATIOS AND ¢ Scores FOR TESTS AND 
Grapes FOLLOWING THE EXPERIMENT 











Test af F t 
California Test of Personality 141.78 00849 
Iowa Silent Reading Test 17 1.6296 1.0833 
Grades 20 


1.3081 6.5** 








**Difference significant at 1 per cent level in denen 
of the non-directive or experimental group. 

Control Group A (Traditional). Seven of 
the original fifteen students in Group A, the 
control group, dropped out of college, five be- 
cause of poor grades and two left of their own 
accord. These two received poorer grades in 
the fall term of 1947 than they had received 
at the close of the previous experimental term 
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TABLE 3 


SUMMARIES OF THE WEEKLY REACTION REPORTS OF 
Four Members or THe Controt Group 


STUDENTS 


Date A B 


3/3 I’m pleased with the 
classes and think I 
well enough to the shall benefit from them. 
present and the 

exercises have 


proved helpful. 


The course has 
been conducted 


3/7 We study so much 
for detail at the 
university that we 
know little about 
grasping concepts. 

3/14 No reaction. We ought to get started 
on some of the major 
topics which are to be 
covered. 

3/21 The course is now This morning’s session 
broadening con- was very interesting 
siderably. and helpful. 

3/28 No comment. None. 

4/23 Would like a dis- We should discuss how 


cussion of note 
taking on reading 
material which 
contains quan- 
tities of factual 
material. 


to prepare for exams. 


(Spring, 1947). There were eight students in 
the Control Group still in college at the close 
of the spring term of 1948. Five showed some 
improvement in one or both of the semesters 
following their attendance in the Academic 
Methods class and three showed a decrease in 
grade point average. 

Experimental Group B (Group Therapy). 
Three of the twelve dropped out of college.” 
Two dropped out at the close of spring, 1947, 
session and a third was dropped by the college 
at the close of the 1948 school year. Of the 


2Original group was 13, but one individual 
dropped the class immediately, leaving an N of 
12 for this group. 


e D 
As far as I see I 


not too much has been 
accomplished. 


find myself 
ing slowly but surely. 
If we had a five day 
schedule we'd progress 


more rapidly. 


can progress 


I personally think course 
is all right. I’m begin- 
ning to notice an im- 
provement in my com- 
prehension and reading 
rate, 
I’m rather well pleased 
with the course so far. I 
believe that my reading 
skill has improved 
slightly. 


So far I see I have a 


tendency to improve 


I find this one of my At 
most interesting and 
beneficial 


present no criticism 


courses. 


I haven’t progressed as 
rapidly lately. The dis- 
cussions are helpful but 
they don’t help my 
reading speed and 
comprehension. 


More reading tests. 


My rate has gone up 222 
with 100% comprehen- 
sion. I don’t think this 
improvement is very 
great and I’m still much 
too slow a reader. 


Would like more forced 
time rapid reading 
material. 


nine who remained in college for two semes- 
ters following the Academic Methods classes, 
three showed an increase in grade point aver- 
age, one had the same grade point average at 
the close of the second semester following the 
course, and the remaining five showed a de- 
crease. 


TRENDS 


An analysis of the reaction reports submit- 
ted by the students in the Academic Methods 
course revealed certain trends. The reaction of 
only four typical students from both the con- 
trol and experimental groups is presented in 
Tables 3 and 4. 

Control Group A. The majority of the stu- 
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TABLE 4 
SUMMARIES OF THE WEEKLY REACTION REPORTS OF 
Four MEMBERS OF THE EXPERIMENTAL GROUP 


STUDENTS 





Date A B e 

2/21 No value to us. So far nothing. I’m An intensely intrigu- 
Value for instruc- expecting bigger and ing varient from con- 
tor. better things. temporary school rituals. 


Class not adapted to 


WILLIAM D. SHELDON AND THEODORE LANDSMAN 


D 


Last two sessions a total 
waste of my time. My 
marks aren’t so good 
that I can waste a 





new environment. study period. 

3/3. Not as much tan- I am familiar with what Method of instruction 
gible work as I we are discussing. Noth- caused considerable 
expected. Help ing solid has been given friction in class, Stimu- 
received from us to stand on. lus to make me follow 
class discussion correct study habits is 
but would prefer lacking, I feel. 
full hour of work. 

3/7 My book has taught me _ Very little progress has 

90% of what I know. been made toward help- 
This course has slight ing the class. The course 
positive value. has great potentialities. 

/14 Class discussion a Recordings were amus- Animated discussion in Course seems to be 
lot better. Read- ing but helped only class. The future developing a little bet- 
ing test good, one person. promises. ter, but still seems 
would like more. to waste time. 

3/21 My roommates tell me Some light is dawning 

I’m studying more, but regarding my confusion 
I don’t believe it is the in the quest of educa- 
result of this course. tion. 

3/28 Examinations will No comment. Delightful material for The class seems to be 
help if continued today’s comprehension progressing more. If we 
every session. check. can have daily tests, I 

believe everyone 
will benefit. 
4/11 Reading tests I don’t believe my study Excellent relaxation The class has come to a 


show my work is 
average. Not 
much improve- 
ment, I feel it 
will take time. 


habits have improved. 
I’m studying more, but 
my marks 


dents in the Control Group seemed highly 
pleased by the way in which the course was 
conducted and felt that they were getting 
something definite from it. They were pleased 
with the reading tests and the positive direction 
the course was taking. These attitudes were 
still maintained at the middle of the semester. 
Toward the end of the sessions, however, there 
appeared in the majority of the reaction re- 


Most interesting item 
came up very late in 
class, 


standstill. A great deal 
of repetition and too 
much silence. 





ports a feeling of dissatisfaction and some ex- 
pression of insufficient progress. 

Experimental Group B. In the experimental 
group the entire class (with one exception) 
was initially antagonistic to the method of in- 
struction. They demanded specific information 
and definite directions and suggested the in- 
structor was lazy and evading all issues. They 
felt that they were wasting time and that the 


GROUP THERAPY WITH STUDENTS IN ACADEMIC DIFFICULTY 215 


views of their classmates could not contribute 
toward improving their methods of study. A 
glance at Table 4 will give the reader some 
idea of how strong some of the reactions were. 

About the middle of the semester one finds a 
slight change in the reactions of the students. 
Table 4 again indicates the change in feeling. 
In the entries for March 14 they state: “Class 
discussion a lot better,” “Course seems to be 
developing a little better,” and “Animated dis- 
cussion in class,” etc. Although they had not 
fully accepted the new method of instruction 
even at the end of the course, their comments 
indicated that they felt more kindly about it 
yet hesitated to admit that it had really been 
a help. 


SUMMARY 


A study is presented of twenty-eight stu- 
dents divided into two “Academic Methods” 
classes. One group is conducted in a primarily 
nondirective manner while the other group is 
conducted in the traditional lecture-discussion 
manner. Prior to the study the groups did not 
differ significantly with respect to test scores 
on the lowa Silent Reading Test, California 
Test of Personality, Ohio State Psychological 
Test, and grade point averages. Following the 
study, the groups did not differ significantly in 
Iowa Silent Reading Test scores and Califor- 
nia Test of Personality scores. The groups did 


differ significantly with respect to grade point 
averages at the conclusion of the semester in 
favor of the nondirective group. One year fol- 
low-up studies revealed that forty-seven per 
cent of the conventional group and twenty-five 
percent of the nondirective group had left 
school. 
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THE IMPLICATIONS OF CERTAIN ASSUMPTIONS IN- 
VOLVED IN THE USE OF THE THEMATIC 
APPERCEPTION TEST" 

J. R. WITTENBORN 


YALE UNIVERSITY 


HEN verbalizations produced in re- 

sponse to the TAT cards and other 
projective tests are used by clinicians as an evi- 
dence for important characteristics of the pa- 
tient, an employment of two assumptions is in- 
volved : 

I. That superficially similar productions 
sample traits or relatively fixed classes of the 
patient’s behavior. 

Il. That certain superficially dissimilar 
productions are “dynamically” related to each 
other. 


In addition to Assumptions I and II, projec- 
tive tests require an assumption which is im- 
plicit in all testing procedures, i.e. that the be- 
havior elicited by the tests is a sample of a kind 
of behavior which may to some degree be elicit- 
ed from the individual by numerous situations, 
including both the naural social situations in 
which behavior has a practical import and the 
artificial testing situations designed for the 
purpose of eliciting behavior under standard 
conditions only. Responses elicited by a test may 
be conceived as relevant to a universe of pos- 
sible responses elicitable by situations (includ- 
ing the test situation) which have certain com- 
mon response-eliciting properties. 

Assumption I, with its emphasis on sampling 
trait behavior, is implicit in two different in- 
terpretive practices in clinical psychology. The 
most common interpretive practice emphasizing 
a sampling of traits is the one wherein the 
clinician looks for repeated manifestations of a 
class of behavior to some degree characteristic 
of individuals in the society from which the 
patient is drawn. Some of these classes of be- 
havior, such as mental abilities, have been sub- 


1The writer wishes to acknowledge gratefully the 
clerical assistance of the Institute of Human Rela- 
tions, the U. S. Public Health Service, and Dr. 
Malcolm Williams of the U. S. Coast Guard 
Academy. 


jected to formal definition and systematic ex- 
ploration. Others, such as passivity, hostility, 
and conflict, are important in clinical jargon 
but ill-defined and unexplored. Perhaps, orig- 
inally these words were employed only to de- 
scribe the manner in which individuals reacted 
to specific situations. At present, however, these 
words are also commonly used to designate at- 
tributes which characterize individuals in a 
variety of situations and which are cor; dered 
to be more or less characteristic of all 1:.ivid- 
uals. 

Just as regularities in the behavior of groups 
of individuals have led to the use of attributes 
as an economical and uniform means of de- 
scribing them, other regularities which are 
uniquely characteristic of the behavior of a 
particular individual are also often ascribed 
considerable significance by the clinician’s in- 
tensive study of that individual. This second 
interpretive practice with its emphasis on the 
sampling of idiosyncratic behavior often leads 
the clinician to a consideration of features of 
the individual’s behavior which may comprise 
the nexus of his problem but which would not 
be discovered if only broad, culturally relevant 
traits were recognized. 


Assumption IT recognizes that psychologists 
commonly regard certain superficially dissimi- 
lar behavioral incidents or manifestations as 
functionally related to each other and accept 
them as an evidence for a broadly integrated 
pattern of behavior. The principles by which 
these different manifestations are bound to- 
gether are often referred to as behavioral dy- 
namics. The clinical psychologist who empha- 
sizes his awareness of the interrelationships be- 
tween symptoms or other superficially different 
behavioral manifestations is often said to be 
“dynamic” or “dynamically oriented.” This is 
particularly likely when he presumes to know 
the direction of influence and speaks of causal 
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factors (e.g. unconscious motivations). The 
rules or principles which descibe functional or 
dynamic relationships are for the most part 
not clearly stated and psychologists differ in 
the degree to which they employ various prin- 
ciples and in the manner in which they inter- 
pret them. As a matter of fact, the most 
thorough-going and prolifically described dy- 
namic behavior theory, the psychoanalytic theo- 
ry, is made up largely of words which are ill- 
defined and concepts which have uncertain im- 
plications. 

Psychoanalytic theory enables the clinician 
frequently to predict the kind of behavior 
which may be expected from a patient and al- 
so to specify types of behavior which are im- 
probable, but the manner in which the clinician 
does this is not rigorously deductive. In many 
important instances, the rules or principles ac- 
tually employed are either unanalyzed or un- 
verbalized so that one says, “I have a feeling 
that” more often than “I deduce from the 
foregoing facts and principles that. . .” Be- 
cause of this, attempts to establish the validity 
of dynamic principles may be more dependant 
on the intuition of the experimenter who 
states the hypotheses than on the adequacy of 
the theory to which the hypotheses purport to 
be relevant. 

For the purposes of the present discussion, 
it is proposed that three different types of con- 
siderations may be employed in rationalizing 
the practice of basing general statements con- 
cerning the individual’s adjustment on the 
manner in which he responds to the TAT 
cards. Specifically, (1) responses may be as- 
cribed a practical significance if they are mani- 
festations of a behavioral attribute of general 
interest ; (2) if they occur with such frequency 
in the productions of the individual as to sug- 
gest that they are uniquely characteristic of 
him; or (3) if they have a status with respect 
to some system of behavioral dynamics. Con- 
siderations (1) and (2) employ Assumption 
I whereas consideration (3) is based upon As- 
sumption IT. If the first type of consideration is 
validly employed and traits which may be used 
to distinguish between individuals in a group 
are directly manifested in responses to the test 
cards, it would be predicted that TAT re- 
sponses which express some of the most com- 
mon traits or attributes would occur with a 
discernible prevalence among the productions 
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of individuals. It would be predicted, more- 
over, that the manifestation of a given preva- 
lent form of response to one card would be 
correlated with a manifestation of that form 
of response occurring in other cards. If such a 
consistency of response tendency could not be 
discerned, one would be inclined to suspect 
that there was no trait-like consistency in the 
behavior of individuals with respect to the 
types of responses sampled by the different 
cards. 

The second type of consideration which may 
be employed in rationalizing the use of the 
TAT would not contribute to the prevalence 
of any form or forms of response. Moreover, 
the personally unique or idiosyncratic nature 
of the response would not contribute to inter- 
correlations among different manifestations of 
the type of response in question, i.e. a partic- 
ular manifestation of the type of response in 
question would not characterize more than one 
or two members of most samples, and as a con- 
sequence this type of response would not pro- 
vide a basis for pairing the individuals; accord- 
ingly, no correlation between various manifes- 
tations of a given mode of response would be 
possible. Although the second type of consid- 
eration is not a subject of the present analyses, 
it is feasible to test the validity of this type of 
consideration and such tests will provide the 
subject for a future report. 

If the development of and interrelations 
among behavioral manifestations are control- 
led by or describable in terms of a set of laws 
or principles of behavior which specify the 
relationship between the individual’s behav- 
ior and the conditions of his existence, it 
would be expected that similarities in the en- 
vironments of a sample of individuals would 
produce similarities between individuals with 
respect to their behavioral manifestations, in- 
cluding those elicited by projective tests. Pre- 
sumably, therefore, the third consideration 
would permit a classification of the responses 
of a group of individuals. The lawful nature 
of behavior is emphasized in the third consid- 
eration because the types of behavior which 
are lawfully interrelated may actually be sup- 
erficially dissimilar in their appearance and 
their interrelationships may not be congruent 
with any ordinary conception of behavioral 
traits or attributes. Although the trait concept 
may be most readily applied to the similarities 
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involved in the first and second considerations, 
it is not implied that the types of behavior in- 
volved in the first and second considerations 
are lawless or unpredictable in their develop- 
ment. 


Although a direct validation of the clinical 
use of the TAT may be readily conceived, such 
direct validating procedures are difficult to 
execute because they require the development 
of criteria, a task which is ordinarily so lengthy 
and expensive as to make desirable prelimin- 
ary studies which indirectly explore the prob- 
able validity or invalidity of plausible assump- 
tions. The purpose of the present report is to 
provide an evaluation of two hypotheses which 
are generated primarily by Assumptions I and 
II, respectively : 

Hypothesis A: Responses to the TAT which 
are similar to each other with respect to class 
of behavior involved will be significantly re- 
lated to each other more frequently than the 
responses among an unselected group are re- 
lated to each other. 

Hypothesis B: Responses for which a func- 
tional or dynamically determined complement- 
ary response may be specified will be signifi- 
cantly related with their complementary re- 
responses more frequently than unselected re- 
sponses will be related to each other. 


Inasmuch as the interpretation of Hypothe- 
ses A and B is not a completely objective mat- 
ter, it is possible that the outcome of an analy- 
sis such as the present one is influenced by the 
responses selected by the investigator. The re- 
sponse selections themselves may be described 
fairly well, however, and it seems probable 
that the general procedures employed may be 
made sufficiently clear to permit an effective 
reproduction of the conditions of the present 
analysis. 

It is emphasized that it is not the purpose 
of the present study to test any particular dy- 
namic theory or its implications. It is intended 
to examine only certain restricted consequen- 
ces of Assumptions I and IT. These assumptions 
are considered by the writer to be implicit in 
the ordinary use and interpretation of the 
TAT, projective tests in general, and other 
clinical material. The method of analysis em- 
ployed in the present study could, however, be 
adapted to the purpose of testing specific in- 
terpretations generated by psychodynamic the- 
ories of human behavior. As a matter of fact, 


in testing Hypothesis B, the specific response 
groups chosen for the statistical tests will have 
a greater relevance to some dynamic principles 
than to others. No serious attempt will be 
made, however, to identify the theoretical 
status of any of the response groups. 


SUBJECTS AND PROCEDURES 


The subjects who provided the necessary 
data for the present analysis were a group of 
100 Yale undergraduates who consulted the 
writer voluntarily. The nature of the prob- 
lems presented and the therapeutic procedures 
employed varied widely. At least one-half of 
the students could be described as chronically 
neurotic, but only a very small fraction of 
them had at any time experienced acute epi- 
sodes or presented psychotic features. 

It is quite possible that the sample of sub- 
jects providing the data may determine the 
relative status of Hypotheses A and B. For 
example, it is possible that the so-called trait 
behavior with its stereotyped manifestations be- 
comes more conspicuously characteristic of 
patients as their illness becomes more severe. 
As a matter of fact, it does not sound unreason- 
able to say that for the most part, broadly gen- 
eralized patterns of reaction characterize the 
behavior of psychotics, and a lack of discrimin- 
ating behavior is an outstanding characteristic 
of severe psychopathology. In most, but cer- 
tainly not all cases, it also seems characteristic 
that the severely affected patient’s behavior is 
overgeneralized in such a manner as to obscure 
the functional interrelationships. between his 
different behavioral manifestations and, as a 
consequence, a verifiable dynamic formulation 
of his behavior is often tediously protracted if 
not infeasible. Among patients such as those 
comprising the present sample, who for the 
most part are moderately neurotic or are 
troubled with transient neurotic symptoms, the 
types of superficial similarities in behavior 
which lead to broad descriptive classifications 
or trait designations ‘are not arrestingly con- 
spicuous, and functional interrelationships 
among many superficially different behavioral 
manifestations seem to obtrude themselves on 
the therapist in a manner which is less obvious 
among most psychotic patients. Although the 
foregoing pattern of speculation suggests that 
the present sample favors the verification of 
Hypothesis B more than Hypothesis A, this 
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is certainly not a matter of fact and may not 
be verifiable. Nevertheless, it is urged that the 
results of the present analysis be generalized 
only to nonpsychotic samples of intelligent 
young men who for the most part are free from 
severe psychopathology. 

The response categories employed in a study 
such as the present one may be expected to 
influence greatly the outcome of the analytical 
procedures. Inasmuch as the purpose of the in- 
vestigation was to explore the implications in- 
volved in the clinical use of the TAT, it was 
decided to employ categories which for the 
most part were clinically relevant to problems 
presented by the subjects. Due to the nature 
of the statistical analyses to be employed, it was 
decided to use only the forms of responses 
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most prevalently provided by the sample of 
subjects. Inasmuch as the manifestations of 
different forms of response were to be inter- 
correlated with each other, it was imperative 
that a form of response employed be relatively 
common, easy to define, readily recognized, and 
not likely to be confused with some other re- 
sponse form. These limitations exclude many 
clinically useful and theoretically important 
forms of response from the present analysis. 
Several systems for classifying TAT respon- 
ses are described in the literature. For the 
most part, however, these groupings do not en- 
joy a widespread clinical use and, as a matter 
of fact, many of them comprise such particular- 
ized aspects of the patient’s responses that the 
clinically significant characteristics of the re- 


TABLE 1 


Card Figures Role Ascribed in Response 


7 BM ; Boy Impetuous.......... 
Compliant.......... 





Man Objective and sympathetic......... 


7 GF Girl Accepting guidance............ 


ES Sa ae 


Mother I ccnctiteaiiorninins 


Ee ccnncstcipnitinniinitaiacnns 


Man Hostile fighter.. 


Conflict 


Good. 


Woman 


6 GF Man Friendly 


Suspicious............ 





Preying............. 


Woman | Ee nee nee 


SE oe nD 


6 BM Mother 


Man 


Sad, shocked, unrealistic.......... 


pe ee 
Confessing guilt....................... 
Emancipation................-....... 


Eternal triangle............................ 


| ES Eee . 


oo eee 


9 BM 


13 MF Woman 


Man 
Guilty over neglect.................... 


EE ee 


NE a ES ee 
SINE ciahabttsoeestiimninaiininabibdovatitcenpeesdeipinipinmiapineaiannidaiie 





GebDeey OVER VERIO CO. ncccncccccecccvstccenscerscccsecnccssessecesses 





A CLASSIFICATION OF THE RESPONSE OF ONE HUNDRED StTuDENTs TO CerTAIN TAT CArps 


Response 
Group 
Number Frequency 
1 18 
2 28 
3 18 
4 60 
5 35 
6 27 
1 20 
& 51 
samnainien 9 36 
10 26 
11 26 
12 67 
13 33 
14 22 
15 34 
16 24 
17 66 
18 68 
hiniicitsiiiinipeminadeens 19 45 
20 17 
—_ _ 21 26 
22 20 
23 20 
24 21 
25 49 
26 35 
27 40 
28 19 
29 26 
30 24 
$1 29 
32 18 











220 


sponses are fragmented. Accordingly, the at- 
tractive possibility of using a generally known 
and standard response classification was fore- 
gone for the sake of employing a response clas- 
sification which seemed clinically relevant to 
the problems of the patients who provided the 
data. The response categories here employed 
were selected from a group mentioned in an 
earlier publication [2]. Frequency of appear- 
ance, not relevance to the hypotheses here pre- 
sented, was the primary basis of the selection. 
The response groupings were not based on the- 
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oretical considerations but comprise only a 
convenient grouping of prevalent, clinically 
relevant response themes. 


The foregoing factors led to an employment of 
the cards and response themes which comprise the 
Table 1. The productions were se- 
cured by a self-administering TAT procedure 
which has been described elsewhere [2]. Some of 
the response classifications given in Table 1 re 
quire explanation. for Card 7GF 
the mother described as pressing (response number 
7) may be demanding love from the child, protest- 
ing and self-sacrifice, 


list shown in 


For example, 


her own love demanding 


TABLE 2 
AN INDICATION OF THE Response Pairs Wuicu Are ReLATep AT THE 10 Per Cent Levet or 


Response 
Group 
Num- 
ber 1 2 3 4 5 6 7 


2 


ow rf 


4 (-)05 10 

5 - (—)10 

5 (—)01 

7 01 
)1 (-)01 


wD 
= 
_— 
a | 


11 (-)10 


23 ~- 05 10 





Response 
Group 
Num- 





CONFIDENCE (SIGN OF RELATIONSHIP INDICATED IN PARENTHESIS) 


10 . (-)05 ( 
10 - 01 


)05 01 
(—)01 


10 - (-)10 10 


(-)10 10 10 


05 10 10 
- - 10 — = 


(-)05 10 
(-)10 10 (-—)10 


— : (-)10 


- - (—)10 
-— -- -— 01 
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TABLE 2 (contTinvep) 


‘Response 

Group 

Num- 

ber 17 18 19 20 21 22 23 


oe ow ec fe Sw 


~ 


~ = 
ol 


~~ 
oe. @ 


— 
<a 


= 
oe 
' 


ws 
oc 


(—)10 . / 


~ 


te 


eo ew te 
oe te 


~ 
& | 
a 
= 
' 
— 
~ 
> | 
~ 


(—)10 
25 (-)10 10 _ 
26 10 . - ~- ~- 


27 — 10 . ~ - 
(-)05 10 — ~ -- 
30 (-)10 ~- — 


31 : -- 
32 — — —_ _ 


- _ / 
- -_ 10 / 


| 
_ 


)05 05 (-)10 7. 10 
(—)10 01 ( )05 / 





Response 
Group 17 18 19 20 21 22 23 
Num- 

ber 

achievement, or in some uninvited way attempt- 
ing to structure the child’s reaction. The man who 
is seen as a hostile fighter (number 9) in response 
to Card 4 may be about to fight or wishing to 
fight but deterred by the woman or other features 
of the situation. For this card, the man in con- 
flict (number 10) is uncertain of what he wishes 
to do; usually, the conflict is concerned with his re- 
action toward the woman. The good woman 
(number 12) in response to Card 4 like the good 
women (response numbers 17 and 27) in response 
to cards 6GF and 13MF is loyal, sexually moral, 
and not destructive in her behavior or attitudes. 
The preying man (number 15) seen in response 
to Card 6GF may be a blackmailer, a sexual ad- 
venturer, or distressing the woman without justi- 


— (-—)05 - 10 / / 


24 25 26 27 28 29 30 $1 32 


fication. The suspicious man (number 14) may be 
a detective, a cuckold, or a niggardly, invidious or 
irritable individual. The conflict response category 
(number 23) for Card 2 invariably refers to con- 
flict, uncertainty, or ambivalence on the part of 
the woman in the foreground. The worker cate- 
gory (response number 25) for Card 9BM refers 
to themes of any kind wherein the reclining men 
are seen to have just participated in or to be de- 
servedly resting momentarily from some construc- 
tive, nonpunitive task. The idler theme (number 
26) is concerned with stories where the men have 
no real purpose or their purpose is to shirk their 
responsibilities or obligations for work. Neither of 
these categories includes fugitive or pursuers. The 
sex conflict of the man seen in Card 13MF may be 
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moral regret for a sexual act or uncertainty in the 
presence of sexual temptation. This response does 
not include repugnance growing out of the physi- 
cal features of the situation or any violent or de- 
structive behavior which occurred in the situation. 


The statistical significance of the relation- 
ship between each of the response themes and 
every other one was determined by means of 
the 7? test for independence. The significance 
of all of the important relationships is shown 
in Table 2. There are 491 possible interrela- 
tionships among the 32 response themes listed 
in Table 1. The alternative roles or themes for 
any given character are mutually exclusive. Ac- 
cordingly, the 24 possible interrelationships 
among mutually exclusive responses or themes 
are eliminated from consideration so that only 
467 of the interrelationships are computed in 
the present analysis. 

Both Hypotheses A and B require that the 
number of significant relationships among re- 
sponse categories designated as specifically rele- 
vant to a particular hypothesis be compared 
with the number of significant relationships 
among “unselected” categories. For the pur- 
pose of making this comparison, relationships 
which are “unselected” are defined? as all of 
the relationships between themes or response 
categories from different cards. There are 438 
such relationships given in Table 2, and 21 per 
cent of these relationships between so-called un- 
selected response or theme groups are signifi- 
cant at the 10 per cent level of confidence. This 
is twice the number of significant relationships 
that would be predicted on the basis of chance 
alone. The fact that the various response cate- 
gories or themes are significantly related with 
each other may incidentally be taken as evidence 
that the various themes may be reliably identi- 
fied. If no significant relationships had been 
found, the data would have been meaningless 
until systematic independent tests of the relia- 
bility of the response classifications had been 
conducted. In general, a lack of significant re- 
lationships under such circumstances could be 
due either to a possible intrinsic lack of rela- 
tionship or to the possible unreliability of the 
response classifications. Fortunately, in this 
study the presence of a goodly number of sig- 
nificant relationships makes a systematic evalu- 

2Other means of defining unselected responses are 


possible, but no reasonable alternative would have 
changed appreciably the results of these analyses. 
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ation of the reliability of the response classifi- 
cation unnecessary. 


RESULTS 


Three separate tests of Hypothesis A were 
attempted. For cards 4, 6GF, and 13MF a 
pattern of response which identifies the woman 
as “good” is commonly given. Patients who see 
the woman as playing a good role in all these 
three situations are frequently described by 
clinicians as inclined to regard women as good 
or to interpret the results in some way equally 
indicative of a tendency to regard each of these 
three responses as behaviorally similar. Actual- 
ly, none of these responses is significantly re- 
lated with either of the others. It is obvious, 
therefore, that Hypothesis A does not apply to 
the “good woman” response category, i.e. the 
meaning of “good woman” is not the same 
when elicited by different cards and any inter- 
pretive practice which assumes their similarity 
is unsupported by these data.° 


Clinicians often speak of conflict as if the word 
referred to an important, broadly general behavior 
characteristic of the individual. At times 
clinicians are more inclined to speak of the “se- 
verity of the conflict” or the prevalence of “evi- 
dence for conflict” than to specify the circumstance 
which elicits conflicting reactions or to emphasize 
the nature of the conflicting reactions. Conflict 1s 
commonly and clearly ascribed by patients to the 
heroes for Cards 2, 4, 7BM, and 13MF (response 
group numbers 3, 10, 23, and 32). 
only two of the six possible interrelationships 
among these conflict themes are significantly re- 
lated at the 10 per cent level of confidence. Al- 
though this fraction of significant interrelation- 
ships may be slightly greater than the fraction of 
significant interrelations found among the unse- 
lected group of responses, there seems to be little 
justification for regarding these data as evidence 
for the appropriateness of Hypothesis A with re- 
spect to the TAT responses classified as conflict 
responses. Whether a state of conflict is as highly 
generalized as many clinicians presume cannot be 
answered from these data, but the data do not 
support an interpretation of the present conflict re- 
sponse themes as being behaviorally similar or as 
having a common implication for the adjustment 
of the patient. 

Clinicians commonly examine test protocols for 
evidence of hostility and if they find one or two 
apparent instances of hostile actions or motives in 
the projective test protocols, they often speak of 
mild or moderate evidences of hostility and state 


some 


Nevertheless, 


8These responses do show significant relation- 
ships with other responses, however. 
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or imply that the patient entertains mild hostile 
feelings. If, however, reference to hostile or de- 
structive behavior or attitudes occurs frequently 
in a test record, they may say that the record is 
full of hostility and claim that the patient is a 
very hostile person. In such instances, it is evi- 
dent that the same kind of interpretation is given 
to a variety of responses which have hostile, rebei- 
lious, unfriendly, or in some other respects de- 
structive associations. In the present series of re- 
sponses, there are several which would often be 
ascribed a hostile significance. Among them would 
be the impetuous boy seen in Card 7BM, the re- 
sistant girl seen in Card 7GF, the hostile man 
seen in Card 4, the preying man seen in Card 
6GF, the guilt-laden man seen in Card 6BM, and 
the man guilty over violence seen in Card 13MF. 
Nevertheless, only two of the 15 interrelations 
among these six response classes (numbers 1, 6, 
9, 15, 20, 30) are significant at the 10 per cent 
level of confidence. Thus, significant interrelations 
among response themes which presumably are pro- 
jections of hostile feelings are no more prevalent 
than they are among unselected themes. This neg- 
ative finding certainly cannot be construed as sup- 
port for Hypothesis A. 


Although the results of these three simple 
analyses cannot be taken as an indication that 
evidence for Hypothesis A cannot be found 
among data such as these, they certainly do re- 
quire the observation of considerable caution 
in ascribing a common meaning to superficial 
similarities among responses to different TAT 
cards. 


In organizing specific analyses appropriate 
to Hypothesis B, groups of responses were se- 
lected by means of an informal employment of 
the following guiding principles: 


a. Their correlated appearance should be 
acceptable as a plausible partial expression of 
some dynamic principle or tendency. 


b. Their correlated appearance should al- 
so be acceptable as an expression of a pattern 
of motivation, conscious or unconscious, fre- 
quently observed in samples of patients such as 
the present one. 


For example, a young man whose behavior is 
impetuous or rebellious in the presence of an older 
man is not likely to see such older men as objec- 
tive or sympathetic. On the contrary, it may be 
predicted that he would tend to regard older men 
as motivated by a desire to punish, exploit, or in 
some manner prey upon relatively vulnerable in- 
dividuals such as women or young men. The pat- 
tern of relationships among responses 1, 4, and 15 
conforms with this prediction. An inverse employ- 


ment of the same considerations would result in a 
prediction that young men who are compliant in 
the presence of older men tend also to regard older 
men as objective and sympathetic and would not 
be inclined to regard men as preying on or ex- 
ploiting women. The relationships among re- 
sponses 2, 4, and 15 confirm this prediction. 

The foregoing example may be accepted by 
some readers as a plausible interpretation of cer- 
tain features of the Oedipal situation. Another 
feature of this situation leads to a second pattern 
of prediction. In response to Card 13MF with its 
varied sexual implications, 18 of the students 
ascribed’ a severe ethical sexual conflict to the man. 
It would be predicted in general that young men 
who are strongly tempted, but nevertheless guilt- 
laden and uncertain, in a sexually stimulating sit- 
uation, would be inclined not to recognize the sex- 
ual motivations of the woman but to regard het 
as innocent. Furthermore, it would be predicted 
that young men who are uncertain of their own 
sexual rights and unable to acknowledge the sex- 
ual requirements of women would tend to see older 
men as preying on women and not as objective 
and sympathetic. The interrelationships among re- 
sponses 32, 29, 4, and 15 confirm this prediction. 

In response to Card 6BM, 26 of the students saw 
the young man emancipating from the home and 
the domination of the mother. Whenever the drive 
toward emancipation and independence is suffici- 
ently strong in youths, they are observed to be 
more rebellious than compliant, and the home is 
not regarded by them as a tranquil place where 
all members are satisfied working toward a com- 
mon goal. The pattern of intercorrelations (among 
response numbers 1, 2, 21, and 24) is congruent 
with this speculation. 


The three analyses relevant to Hypothesis B 
tend to conform with the predictions, whereas 
the analyses relevant to Hypothesis A offer no 
support for that hypothesis. As already empha- 
sized, however, the results of the analyses are 
dependent upon the relationships examined and 
the reader may scrutinize Table 2 in order to 
satisfy himself concerning the relative status 
of Hypotheses A and B. An element of dynam- 
ic plausibility may be “read into” a number of 
the interrelationships. Although it would be 
extremely hazardous to ascribe any general 
significance to such inferences, they could, of 
course, be stated formally and another sample 
of data gathered and analyzed for the purpose 
of examining the validity of any inferences that 
the present data suggest. 


Although the present analyses suggest that 
TAT interpretations based upon a trait con- 
cept of behavior should be employed with con- 
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siderable caution, the possibility of a profitable 
use of trait concepts is by no means excluded 
by these limited data and restricted analyses. 
The analyses do strongly suggest that the most 
consistent interrelationships among ‘TAT ma- 
terial provided by students can best be under- 
stood through an employment of behavioral dy- 
namics. ‘Che manner in which the present data 
challenge the descriptive trait approach to the 
study of behavior and direct our attention to 
the importance of the motivations served by 
the behavior is reminiscent of some of the 
classical studies of Hartshorne and May [1]. 

The present analyses provide an interesting 
methodological exploration. Too many psy- 
chologists have resolutely claimed that dynamic 
material cannot be analyzed by statistical 
means and that such quantitative statistical 
analyses are necessarily static and invariably 
antithetical to a dynamic conceptualization of 
behavior. It may indeed prove possible to con- 
duct systematic, quantitative explorations of 
important dynamic interrelationships found in 
clinical material through an employment of 
very simple statistical procedures. 


SUMMARY AND CONCLUSIONS 


The present study comprises a statistical 
evaluation of data relevant to the following 
hypotheses : 


Hypothesis A: Responses (to the TAT 
cards) which are similar to each other with re- 
spect to class of behavior involved will be sig- 
nificantly related to each other more frequent- 
ly than the responses among an unselected 
group will be related to each other. 


Hypothesis B: Responses for which a func- 
tional or dynamically determined complement- 
ary response may be specified will be signifi- 
cantly related with their complementary re- 
sponses more frequently than unselected re- 
sponses will be related to each other. 


The data comprise a categorization of TAT 
productions provided by 100 Yale undergrad- 
uates in response to 8 selected cards. For the 
most part the response categorizations were 
based upon the roles commonly ascribed by the 
students to the figures appearing in each card. 
For this reason relatively few response cate- 
gories were common to two or more cards. 

The results of the analyses may be summar- 


J. R. WITTEN BORN 


ized: 

1. Asa means of evaluating Hypothesis A, 
three different sets of intercorrelations among 
superficially similar response categories were 
examined. In no one of these sets was there a 
discernible tendency for superficially similar 
response categories to be consistently interre- 
lated. ‘The failure of the present data to pro- 
vide evidence in support of Hypothesis A sug- 
gests that clinicians should be cautious in as- 
cribing a trait significance to the multiple ap- 
pearance of TAT response roles or themes 
which are superficially similar to each other. 
The relative paucity of significant interrela- 
tionships within the sets of response categories 
relevant to Hypothesis A does not preclude the 
possibility that some other superficially similar 
response categories would be positively interre- 
lated and could thereby provide support for a 
trait emphasis in the clinical use of the TAT 
productions. The failure of the present data 
to provide evidence for Hypothesis A may 
mean, however, that the psychological signifi- 
cance of a given response is contingent upon 
the context in which the response is elicited and 
that superficially similar responses to different 
TAT cards would not be behaviorally similar 
unless the cards in question were highly simi- 
lar in their general response eliciting proper- 
ties, i. e. had in general a similar total meaning 
for a large number of subjects. 


2. Three sets of intercorrelations were ex- 


amined as a means of evaluating Hypothesis B. 
The response categories comprising each of 
these sets were superficially dissimilar to each 
other, but were considered to be related with 
each other in a manner consistent with a dy- 
namic interpretation of behavior, e.g. the re- 
sponse categories within a set could be con- 
ceived as a partial expression of a single pat- 
tern of motivation, conscious or unconscious. 
The response categories within a set were in- 
tercorrelated in a fashion consistent with the 
requirements of Hypothesis B. There are sev- 
eral specific respects in which this is an impor- 
tant finding: 


a. It suggests that the responses elicited 
by projective test devices may be lawfully in- 
terrelated in a manner consistent with a theory 
of personality. 


b. It suggests that clinicians may profit 
from a self-conscious and deliberate use of per- 
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sonality theory in interpreting projective test 
responses. 


c. It suggests that the intercorrelations 
among different categories of responses to pro- 
jective test material may be used as a method 
for testing the relative validity of alternative 
hypotheses in a personality theory. 


It is important that the results of a study 
such as the present one not be overgeneralized 
and not be incautiously applied to untested 
situations. ‘The relative status of Hypotheses A 
and B may vary with the patient group, with 


the manner in which responses are categorized, 
and with the relative pertinence to Hypothesis 
A or B that a given investigator ascribes to a 
particular response category. 
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OBJECTIVE RORSCHACH RESPONSE PATTERNS 
DIFFERENTIATING ANXIETY REACTIONS FROM 
OBSESSIVE-COMPULSIVE REACTIONS 
SOLIS L. KATES' 


MICHIGAN STATE COLLEGI 


HE purpose of this investigation was to 

differentiate between the Rorschach test 
responses of anxiety reaction patients and those 
of obsessive-compuisive patients who were seek- 
ing assistance at a mental hygiene clinic. It was 
also intended to examine the significance and 
meaning of certain Rorschach response cate- 
gories in accordance with their capacity to 
discriminate between anxiety reactions and ob- 
sessive-compulsive reactions. 

The patient with an anxiety reaction dis- 
order is suffering primarily from many unre- 
lieved tensions that contribute to his restless- 
ness and that interfere with adequate satisfac- 
tion from his achievements. His anxiety is dif- 
fuse, is not restricted to specific situations, and 
is not bound by any psychological defense 
mechanisms [10]. Both psychological and 
physiological aspects of anxiety are experienced 
by the patient. The anxiety reaction patient 
has defenses that are only partially adequate 
against his fear of catastrophic breakdown and 
disaster to his personality structure. These de- 
fenses are not sufficient to control and suppress 
those fears even temporarily [9]. He is also 
egocentric in that he concerns himself, first of 
all, with his own affairs before his attention 
can be devoted to any tasks at hand [7]. 


The patient with an obsessive-compulsive re- 
action is characterized by uncontrollable and 
irrepressible tendencies to say, do or think 
something in a particular way [1]. These ten- 
dencies are said to have value as distractions 
and as tension-reducing mechanisms which 


1Miss C. Mildred Ceres, clinical psychologist, 
and Mr. Melvin Kornreich, clinical psychology 
trainee, Brooklyn VA Mental Hygiene Clinic, pro- 
vided considerable assistance in planning and car- 
rying through this investigation. Interpretations 
and conclusions do not reflect opinions of the Vet- 
erans Administration. 


may allay in some manner his anxiety [6]. It 
has been said that the patient is attempting to 
satisfy his need for safety and security through 
these obsessive-compulsive mechanisms [4]. 
The irrepressible tendency is reported to be 
more tolerable and less painful to the patient 
than the unpleasant memories or acts it is re- 
puted to replace [8]. By this process, it is be- 
lieved overt anxiety is moderately well avoided 
[9]. The anxiety is apparently under control 
as a result of the irrepressible tendencies acting 
as temporary shields against threats to the 
personality. 

It would appear that the degree of inner 
distress experienced by the anxiety reaction 
patient is greater than that of the obsessive- 
compulsive patient and that this difference in 
degree of inner distress would be manifested on 
a test such as the Rorschach. Finally, certain 
Rorschach responses would be associated with 
the specific adjustment patterns of the anxi- 
ety reaction patient in contrast with those Ror- 
schach responses that would be related to the 
particular modes of adjustment of the obses- 
sive-compulsive patient. 

Subjects. The subjects of this investigation 
were patients at a mental hygiene clinic of the 
Veterans Administration. Twenty-five patients 
exhibiting symptoms of an anxiety reaction 
and twenty-five patients with symptoms of an 
obsessive-compulsive disorder were selected and 
agreed upon by three psychologists as represent- 
ative of these reactions. They were chosen on 
the basis of symptoms alone, independent of 
test results and previous diagnostic classifica- 
tions. There was no significant difference be- 
tween the mean IQ and the mean chronologi- 
cal age of the two groups. 


Method. The Rorschach test was adminis- 
tered to the subjects and their Rorschach re- 
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sponses were evaluated by means of the Mun- 
roe Inspection Technique [5]. ‘The Munroe 
Inspection Check List consists of 28 categories 
embodying the major Rorschach scoring sym- 
bols. Specific numbers of entries were made for 
each response category according to the ex- 
tent of its deviation from predetermined clin- 
ically established normal limits. If within 
these normal limits, no entry was made. If out- 
side these normal limits, one, two, or three en- 
tries were made, dependent upon the extent 
of the deviation. The total score was the sum of 
the number of entries. A high score signified 
many signs of maladjustment and much inner 
tension as reflected in the Rorschach responses 
while a low score indicated few signs of Ror 
schach maladjustment and little inner tension. 

Results. The mean number of signs of Ror- 
schach maladjustment as indicated by the 
Munroe Inspection Technique was 10.0 for 
the obsessive-compulsive patient, with a stand- 
ard deviation of 3.6. For the anxiety reaction 
patient, the mean Munroe Inspection score was 
13.2 with a standard devi:tion of 4.0. The dif- 
ference between these mein scores was signifi- 
cant at the one per cent level. 

The mean number of the responses given to 
the Rorschach cards by the obsessive-compul- 


sive patient was 37.6 and the standard devia- 
tion was 18.9. The anxiety reaction patient’s 
mean number of responses was 18.8, standard 
deviation of 8.5. There was a significant dif- 
ference at the one per cent level between the 
means. 

Table 1 lists six Rorschach response cate- 
gories that differentiated significantly between 
the obsessive-compulsive and the anxiety reac- 
tion patient. The remaining response categor- 
ies of the Munroe Inspection Check List were 
nondiscriminating. 

Interpretation. The greater number of Ror- 
schach signs of maladjustment of the anxiety 
reaction patient may be attributed to his more 
persistent feelings of being under attack, of be- 
ing helpless and worthless. ‘The loss of the af- 
fection and esteem of others is gravely feared. 
The anxiety reaction patient does not have the 
defense mechanisms of the obsessive-compul- 
sive patient that are vigorous reactions against 
similar threatening and helpless feelings. ‘These 
unacceptable feelings are counteracted by the 
use of irrepressible tendencies which help re- 
duce inner distress in the obsessive-compulsive 
patient. This finding should not be construed 
that, given an equivalent degree of anxiety, 
the obsessive-compulsive patient, by utilizing 


TABLE 1 


DISCRIMINATIVE RORSCHACH ResPONSE CATEGORIES ON THE Muwnroe Inspection Cueck List 


Obsessive 


Rorschach Response Compulsive 
Category Patients 


). §—F 
Rejection of one or : 
more Rorschach Cards 


Rejection SS ee ae 3 

Non-Rejection....................-....-..- 22 
Rare Detail (dr) Responses 

Ee CI PU oevccereninaiccevesecntnctince 11 

10% or more....... . 14 
Exclusive Form (F) Responses 

ae 19 

A te 6 
Inanimate Movements (m) Responses 

Less than two............ 18 

(>) | ee 7 
Definite Form-Color (FC) Responses 

More than one... dali Ricindniates 15 

| Eee eer 10 
Color (FC ,CF, C) Responses 

More than two...................-..-..-- 14 

Cy a ee 11 








Anxiety Chi-Square Degree of Level of 
Reaction Freedom Significance 
Patients 
N95 
10 4.9 1 5% 
15 
19 5.6 1 2% 
6 
10 6.2 1 2% 
15 
24 4.2 1 5% 
1 
7 5.2 1 5% 
18 
7 4.02 1 5% 
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his defense mechanisms, will suffer less inner 
distress than the anxiety reaction patient. 

The greater number of responses of the ob- 
sessive-compulsive patient, in contrast to the 
anxiety reaction paticnt, may be explained by 
his greater need tc demonstrate his adequacy. 
On the other hand, the anxiety reaction pa- 
tient is so engrossed in himself and in his efforts 
to avert a catastrophic breakdown that he can- 
not devote himself to the task at hand. The 
task at hand is not the center of interest ; rather 
the necessity to arm himself against imminent 
breakdown is predominant. 

The anxiety reaction patient, by rejecting 
more Rorschach cards than the obsessive-com- 
pulsive patient, indicated that he adjusts to 
many situations by a process of rejection and 
withdrawal. On the other hand, the obsessive- 
compulsive patient may be said to adjust to 
many situations by a drive for completeness 
and quantity. 

The greater tendency of the obsessive-com- 
pulsive patient to give rare detail (dr) re- 
sponses, when contrasted with the anxiety re- 
action patient, reflected his meticulousness. 
There seemed to be a drive within the obses- 
sive-compulsive patient to arrange his world 
in an orderly fashion by taking into considera- 
tion the many minute details of his life. Fur- 
thermore, the obsessive-compulsive patient prob- 
ably feels that social value accrues by a mental 
approach focusing on minute details. Gold- 
farb’s investigation discovered a similar trend 
in the utilization of the rare detail (dr) re- 
sponse by the obsessive-compulsive adolescent 
[2]. 

The exclusive form (F') responses have been 
said to demonstrate the degree to which the 
individual was prone to control his more per- 
sonal or spontaneous reactions [3]. It would, 
therefore, appear that the anxiety reaction 
patient, more than the obsessive-compulsive 
patient, fears his spontaneity and therefore 
seeks refuge in the more formal aspects of be- 
havior. Not having sufficient defenses for the 
control of the fears aroused by his spontaneous 
reactions, the anxiety reaction patient resorts 
to an impersonal, matter-of-fact way of deal- 
ing with situations. The obsessive-compulsive 
patient, too, is fearful of spontaneity but not to 
the extent of the anxiety reaction patient. For 
he has erected some defenses that cope, even if 
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temporarily, with the anxiety provoked by spon- 
taneity. 

It may not, therefore, be true that obsessive- 
compulsive elements are present whenever the 
percentage of exclusive form (/') responses is 
fifty or more as Klopfer suggests [3]. The use 
of fifty per cent or more exclusive form (F) 
responses in the Rorschach test should perhaps 
be more generally interpreted as a need to be 
impersonal and matter-of-fact to avoid anxiety 
and possible acute breakdown. 

The inanimate movement (m)_ responses 
may be interpreted, as Klopfer suggests [3], 
as the experiencing by an individual of uncon- 
trollable inner forces working upon him. Their 
more frequent presence in the responses of the 
obsessive-compulsive than in those of the anx- 
iety reaction patient would support this inter- 
pretation. ‘he inanimate movement (m) re- 
sponses may be construed then as being indica- 
tive of obsessive-compulsive trends. It also ap- 
pears that much inner tension may be present 
without the appearance of inanimate movement 
(m) responses. 

The obsessive-compulsive patient’s capacity 
for emotional adjustment to outer reality is 
greater than that of the anxiety reaction pa- 
tient as shown in his greater use of form-color 
(FC) responses. The anxiety reaction patient 
fails to display much socially acceptable affect 
because he is so greatly concerned with himself 
and the anticipation of threat from every quar- 
ter. Furthermore, he is so fearful that emotion- 
al relations with others may result in addition- 
al anxiety that he limits the extent of his warm, 
spontaneous contacts. The same reason would 
apparently explain the lack of use of an ade- 
quate number of color (FC, CF, C) responses 
by the anxiety reaction patient when contrasted 
with the obsessive-compulsive patient. 

Summary. It was pointed out that the anxi- 
ety reaction patient had more persistent feel- 
ings of helplessness and worthlessness, which 
were manifested in a higher Rorschach mal- 
adjustment score. The obsessive-compulsive 
patient had more defense mechanisms which 
helped to reduce his feelings of inner distress, 
probably resulting in a lower Rorschach mal- 
adjustment score. The much larger number of 
Rorschach responses of the obsessive-compulsive 
patient, when contrasted with the anxiety re- 
action patient, may be due to his greater need 
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to demonstrate his adequacy and to offset feel- 
ing of inferiority. It may also be due to the in- 
ability of the anxiety reaction patient to get 
away from his own problems. Similar reasons 
were apparently associated with the greater 
number of rejections of Rorschach cards by the 
anxiety reaction patient when contrasted with 
the obsessive-compulsive patient. 

The obsessive-compulsive patient demon- 
strated a greater tendency to consider and mull 
over the minute and relatively unimportant de- 
tails of life. The anxiety reaction patient was 
more inclined to utilize impersonal, and mat- 
ter-of-fact ways of dealing with the environ- 
ment so as to limit his spontaneity. The obses- 
sive-compulsive patient, having defense mech- 
ansims, can be less formal and may maintain 
a socially desirable facade. It was suggested 
that the use of fifty per cent or more exclusive 
form (F) responses in the Rorschach test 
should be generally interpreted as a need to 
avert anxiety rather than as a sign of obsessive- 
compulsive traits. 

The use of two or more inanimate move- 
ment (m) responses was interpreted to signify 
the presence of obsessive-compulsive traits, 
since they were more frequently given by the 
obsessive-compulsive patient. It seemed that 
relatively great inner tension may prevail with- 
out the appearance of inanimate movement 
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(m) responses. It was also concluded that the 
anxiety reaction patient adjusts to many situa- 
tions by withdrawal and rejection while the ob 
sessive-compulsive patient adjusts by a drive 
for quantity and socially acceptable behavior. 


Received August 16, 1949. 
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THE SUSCEPTIBILITY TO FALSIFICATION OF THE 
RORSCHACH PSYCHODIAGNOSTIC TECHNIQUE 


ABRAHAM L. CARP anp ARTHUR R. SHAVZIN 


UNIVERSITY OF WISCONSIN 


N 1939, L. K. Frank [4] defined projective 

methods as a method for the study of per- 
sonality involving the presentation of a stimu- 
lus situation designed or chosen because it will 
mean to the subject not what the experimenter 
has arbitrarily decided it should mean (as in 
most psychological experiments using stand- 
ardized stimuli in order to be “objective’”’) but 
rather whatever it must mean to the personal- 
ity who gives it, or who imposes upon it his 
private, idiosyncratic meaning and organiza- 
tion. The purpose of projective techniques is 
no different than that of the inventory-type of 
personality tests, that is, to provide information 
needed for the understanding of an individual. 
However, in terms of approach to personality, 
projectives are quite different from other per- 
sonality tests. The method, in general, is to re- 
veal the total personality, or aspects of the per- 
sonality in their framework of the whole. 


Because of the characteristics of projective 
tests, it is dificult not only to interpret pro- 
jective responses quantitatively, but also to test 
their reliability statistically. Yet, because of 
their widespread use, reliability studies are of 
the utmost importance. Three techniques are 
commonly used to find the reliability of a test. 
They are the retest reliability, split-half relia- 
bility, and equivalent-form reliability. In gen- 
eral, only retest reliability can be used for pro- 
jective techniques because there usually are no 
equivalent forms of the test, and because it is 
difficult, if not impossible to split the test into 
equivalent halves. Further, the results of a pro- 
jective test depend upon the whole test. Con- 
sideration, however, must be taken of person- 
ality changes which may take place between 
test and retest as well as the effect of recall of 
previous responses on responses to the retest. 
(See Bell [1] fora discussion of this problem) 


PROBLEM 
One problem, which has ever been a nemesis 


to paper and pencil and other inventory-type 
tests is that of “faking,” i.e., responding with 
what the subject thinks will create the best im- 
pression, rather than the true one. Some tests 
attempt to eliminate this problem by techniques 
which will show a preponderance of “lies” in 
the responses. The Minnesota Multiphasic [7] 
for instance, has an L score, which, if too high, 
indicates that the results are not reliable. Al- 
though projective techniques are believed to 
have a strong point in that they reduce “‘fak- 
ing” to a minimum, there are few studies con- 
cerned with this problem. 

Two studies dealing directly with faking on 
the Rorschach have been reported by Fosberg 
[2, 3]. In his first study he gave the tests to 
two subjects under each of four different sets 
of instructions. First, they were given the 
standard instructions, then told to “make the 
best impression,” “the worst impression,” and 
finally, to look specifically for certain things 
which the experimenter asked to be sought, e. 
g., “Show me all the humans in action that 
you can find.” 

After scoring and tabulating the protocols 
in the usual way by the Klopfer method, the 
tabulations of each set of four were treated 
statistically in the following manner. Fosberg 
states: 


We first proposed to test the null-hypothesis: 
“All four records do not vary significantly from 
each other.” Stated another way, we can ask: “Are 
all four records no more than random deviations 
from a distribution in which the Rorschach cate- 
gories are in a fixed relationship to one another?” 
These queries stress the probability function of our 
statistical treatment. Chi-square is also a test uf 
“goodness of fit,” so we can also ask: “How closely 
do the given frequencies fit the theoretical distribu- 
tion of frequencies in the individual?” “—- 
To this end we calculated the chi-square for all 
four tests at once. 


Fosberg realized that combining Rorschach 
categories might be unjustified, but since the 
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chi-square test tends to exaggerate the discrep- 
ancy between observed and theoretical data 
when it is calculated with low theoretical fre- 
quencies, the only alternative was to omit cate- 
gories with such low frequencies. He finally 
decided, after consultation with Klopfer, to 
take both risks, that is, combine in some cases 
and keep low frequencies in others. His results 
for the uncombined categories from which chi- 
square was calculated for all four tests were 
probabilities of .950 and .975 for the two sub- 
jects. For the combined categories for all four 
tests, he got probabilities of .995 and .814, re- 
spectively. On this basis he concluded that the 
“Rorschach test does not lend itself to manipu- 
lations based upon the subjective personal es- 
timation of character, good, bad, or indiffer- 


ent but faithfully traces the more permanent 
picture.” 


In a later study, Fosberg used a correlation- 
al technique. He used 129 subjects, both adults 
and children, and of both sexes, which fell in- 
to the following groups: 

(1) Bernreuter “Control” Group—consisting of 
37 subjects who took the Bernreuter Inventory un- 
der the usual standard instructions, then to make 
the best impression, the worst impression, and final- 
ly under standard conditions once more. 

(2) Rorschach “Control” Group—consisting of 
26 subjects reported in the literature. Each of these 
subjects had had two or more protocols published. 

(3) Main Experimental Group — consisting of 
25 males and 25 females, professors, graduate and 
undergraduate students of the psychology depart- 
ments at New York University and Columbia Uni- 
versity. They all took the test under the conditions 
mentioned for the Bernreuter Group above. 

(4) Special Experimental Group—consisting of 
16 subjects who took the Rorschach test as de- 
scribed for the main Experimental Group, except 
that in the last test, the Rorschach determinants 
were all pointed out to these subjects and they 
were asked to see as many as possible. 


Correlations were calculated for all the 
groups between tests and retests. In the case 
of the Rorschach groups, the Rorschach Sec- 
tions, “Location,” “Determinants,” and “Con- 
tents,” were correlated separately as well as 
the test as a whole. Fosberg concluded that 
“The Rorschach withstood all the attempts at 
manipulation by the subjects in the two di- 
rections of making better or worse impressions 
of their personalities, whereas the Bernreuter 
Inventory could not withstand such manipula- 
tion.” 


Since Fosberg’s studies are the only ones 
which are concerned with “faking” a Ror- 
schach, it is important that his results be veri- 
fied, especially so since the correlations he ob- 
tained in his second study are as high, or high- 
er, than those obtained in reliability studies of 
most personality and even intelligence tests 
(the correlation coefhcients ranged from .828 
to .944). The necessity of further work is ob- 
vious. In his second study he used a larger 
number of subjects but used a correlation co- 
efficient to compare the tests. He does not state, 
however, which type of correlation he used." 
It is dificult to see how one correlation coefh- 
cient could be applied to the whole test since 
the Rorschach categories cannot be considered 
as class intervals. Also, in using the distribution 
of scores as equally related to the whole test, a 
difference of three where vhe responses in the 
category range from zero to three would be the 
same as a difference of three where the range 
is from 17 to 20. Further, he does not state 
which figures he used in making his computa- 
tions. That is, it is not known from the art- 
icle whether he calculated the correlation for 
each individual and determined the mean or 
whether he lumped the responses of each cate- 


gory together to obtain his correlation coeffi- 
cients. 


This study, therefore, was designed to test 


further the susceptibility of the Rorschach to 
falsification. 


PROCEDURE 


In the present study, the Rorschach test was 
given twice, three weeks apart, to each of 20 
male elementary psychology students. The sub- 
jects were asked to imagine themselves in cer- 
tain situations while taking the test: once, to 
give a “good” impression, and once, to give a 


“bad” impression. The directions were as fol- 
lows: 


This test consists of a series of 10 cards which 
are designs made up out of ink blots. People see 
all sorts of things in these ink blots; now tell me 
what you see, what it might be for you, what it 
makes you think of. However, I want you to 
imagine yourself in this situation. 

Situation I: (for bad impression) you 
are being drafted for the army and want very 
much to stay out. The results of this personality 

1Personal communication from Dr. Fosberg 


states that he used the product-moment coefficient 
of correlation. 
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test will determine whether or not you will be 
accepted. An unfavorable report will keep you out. 

Situation II: (for good impression) ... . you 
are in a hospital for the mentally ill and, nat- 
urally, want to get out. You are being given this 
personality test to see whether or not you can be 
released. A favorable report will get you that 
release. 

Now let’s go through the test with these cir- 
cumstances in mind. What do you see in the first 
card? 

One half of the subjects were presented with 
Situation I first, the other half were presented 
with Situation II first. The protocols were 
scored and tabulated in the usual way by the 
Beck system. 

Each category of the “good” and “bad” tests 
was then compared by the ¢ test and correla- 
tion coefficient. Where the number of responses 
in a category was too small to obtain a reliable 
Pearson product-moment correlation, a tetra- 
choric correlation was calculated. 

The problem of not enough responses in 
some categories was solved by combining cer- 
tain categories which are similar. Thus, all 
light responses (VY, Y, FV, FY, VF, YF) 
were combined, as were human responses (H 
and Hd), animal responses (4 and Ad). Color 
responses were compared, both individually 
and as awhole. Ct referring to total color was 
calculated by weighting the color value as Beck 
does in calculating Experience Balance, i.e., 
C, 1.5; CF, 1.0; FC, 0.5. 

The next step was to compare the whole 
tests, both “good” and “bad,” as units, by cal- 
culating the chi-square and probability of ho- 
mogeneity. Here too, categories were combin- 
ed; these, in addition to those mentioned above, 
being the form determinants (F+, F—, and 
F), color (this time each color response as a 
unit), and a miscellaneous category consisting 
of all content responses other than human or 
animal responses. When combining the categor- 
ies still did not provide enough responses for 
reliable calculations the category was omitted. 


RESULTS 

Table 1 shows the results of comparing each 
category by ¢ test and correlation. The ¢ test 
was used to see if a change in response, due to 
the conditions of this experiment, would have 
direction, i.e., would trying to make either a 
good or bad impression increase or decrease 
certain or all responses. It is seen that only in 


Z (organization) is the deviation from the 
mean significant, and that at the 5 per cent c.l. 
This does not mean, however, that no changes 
were produced. The data clearly showed the 
differences. But the direction taken was so di- 
verse, among the individual subjects, that they 
were balanced out in the analysis. ‘Thus it is 
not possible to predict in what way a subject 
will respond in his attempt to “fake’’ the results. 

Five of the correlation coefficients are above 
.85. Three of them pertain to form (F+, F—, 
and F’), one to time per response (T/R), and 
one to the number of responses per test (R). 
The remaining correlations ranged from .16 
to .84. According to Rorschach theory, Content 
is most subject to manipulation, but since it is 
the least important of the Rorschach sections 
for personality interpretation, this lack of con- 
sistency is of no importance. In this study, how- 
ever, the Content responses of human and ani- 
mal categories had correlations of .80 and .82 
respectively, while Wholes (4), Movement 
(M), Color-form (CF), and Total Color 
(Ct) had correlations of .16, .17, .21, and .33 
respectively. In general, then, it must be con- 
cluded that the total responses of all the sub- 
jects for each category of the “good” test do not 
show a close relationship to the total responses 
of all the subjects for the same category of the 
“bad” test. These test-retest reliabilities are 
somewhat lower than reported for these cate- 
gories when there is no attempt to distort the 
personality picture [5, 6]. 

















TABLE 1 
COMPARISON OF Test CATEGORIES 
(N= 20) 
CATEGORY t r o@ CATEGORY ¢t r a 
-W 1.00 .16 .223 Ct 1.27 .83 .856 
-D 1.07 .91 .079 H+Hd 1.82 .80 .083 
Dd 1.01 .84 .068 A+Ad 1.00 .82 .075 
Light 1.00 .64 .135 R 1.17 .90 .944 
F plus 1.06 .95 .022 Z 2.32** 68 .123 
Fminus 1.43 .97* .415 F+% 1.00 .59 .150 
F 1.00 .88 .052 A % 1.00 .46 .181 
M 1.00 .17* .878 P 1.42 .55 .160 
Cc 1.00 .65* .391 T/IR 1.00 .56 .157 
CF 1.00 .21* .382 T/R 1.00 -93 .031 
FC 1.00 .48* .435 Ss 1.53 


.69* .361 
*Tetrachoric correlation a anaes pa ama 
**Significant at the 5 per cent confidence level. 


To compare each subject with himself, a chi- 
square technique was used. It was assumed that 
each Rorschach category is an independent 
measure, thus the number of responses in each 
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category could be considered as the frequency. 
Chi-squares were calculated to determine if the 
two distributions from each subject were simi- 
lar. Table 2 shows the results of this analysis. 
At the five per cent level of confidence the dis- 
tribution differed significantly for three sub- 
jects and at the ten per cent level for two more 
subjects. It may be assumed, therefore, that un- 
der the conditions of this experiment, there 
were some subjects who were successful in pre- 
senting two different response patterns on the 
Rorschach. 


TABLE 2 
Decree OF HomMoceNnetry oF “Goop” 
AND “Bap” Tests 


SUBJECT « df P SUBJECT *@ df P 


I 1.506 6 .95-.98 XI 13.196 9 .10-.20 

II 7.840 9 .50-.79 XII 2.392 7 .90-.95 
Ill 16.833 9 .05-.10 XIII 5.008 8 .70-.80 
IV 8.935 7 .20-.30 XIV 12.604 6 .02-.95 
Vv 7.997 7 .30-.50 XV 8.527 9 .50-.70 
VI 13.834 7 .05-.10 XVI 1.678 9 .99 
VII 8.866 7 .80-.90 XVII 8.315 56 .50-.70 
Vill 4.529 8 .80-.90 XVIII 15.584 6 .01-.02 
IX 4.651 9 .80-.90 XIX 1.702 6 .90-.95 


x 6.006 5 .30-.50 XxX 28.744 8 O01 


DISCUSSION 


The results of this study suggest that it is 
possible for some subjects to distort their Ror- 
schach protocols depending on the set to make 
a “good” or “bad” impression. It appears that 
subjects’ desire to “fake” must be considered. 
There is still, of course, the possibility that the 
clinician would have no difficulty in recogniz- 
ing the “faked” test. Also there has been no 
attempt to determine whether the subjects 
were “successful” in altering their responses 
in the desired directions. But the implication 
that these factors need not be considered is not 
supported. 

It should be pointed out also that the use of 
the chi-square technique to ascertain the simi- 
larity of the two tests is not completely justi- 
fied. The test implies that each category is in- 
dependent, that is, that there is, for example, 
no relation between W and M. This assump- 
tion is probably not tenable. The method was 
employed, however, to see how closely these re- 
sults would approximate those obtained by 
Fosberg using a similar technique. A further 
study is in progress to determine whether by a 
rating and matching technique judges can dif- 
ferentiate between simulated tests. 


SUMMARY 


Twenty subjects were given the Rorschach 
twice under conditions designed so that they 
would give a “good” impression in one instance 
and a “bad” impression in the other. A ¢ test 
comparison of each category in the two tests 
showed no group differences although there 
were wide individual differences in the direc- 
tion of the responses in a subject’s attempt to 
manipulate his responses. Coefficient correla- 
tions for separate scoring categories ranged 
from .16 to .97, suggesting that there is not a 
close relationship between certain categories of 
the “good” test and the categories of the “bad”’ 
test. 

The two tests for each subject were then 
compared as units by means of chi-square to 
see whether the two distributions were similar. 
For four subjects the probabilities were less 
than .10 that the two distributions came from 
the same population. It is seen, therefore, that 
there were some subjects who were able to alter 
their responses. 


In conclusion, the results of this study do not 
agree with those obtained by Fosberg, which, 
he states, indicate that “the Rorschach with- 
stood all attempts at manipulation by the sub- 
jects.”” On the contrary, this study shows that 
there are some subjects who can manipulate 
their responses, who can vary their personality 
picture as reflected by the Rorschach, under in- 
structions to make “good” or “bad” impres- 
sions. 
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A NOTE ON THE VALIDITY OF THE LEITER 
INTERNATIONAL PERFORMANCE SCALE 


TRENT E. BESSENT 


NORWALK STATE HOSPITAL, CALIFORNIA 


HE Leiter International Performance 

Scale, hereafter referred to as the LIPS, 
was published in 1940 [1] as a year scale 
similar in design to that of the Binet but 
based solely upon performance items. During 
the war the scale was not produced, but recent- 
ly an announcement has been made of its avail- 
ability [2]. 

Procedure. Over a period of a year and a 
half data were gathered on twenty cases who 
were tested by both the Revised Stanford 
Binet (Form L) [3] and the LIPS (1940 ed- 
ition) in the clinic of the court psychologist 
for Santa Barbara County, California. ‘This 
sample was composed of thirteen male and 
seven female court cases, with fifty per cent be- 
ing of Mexican ancestry. In chronological age 
they ranged from four years to fifty-eight 
years with the median age being about eleven. 
The middle fifty per cent ranged from about 
nine years to sixteen years of age. By referring 
to Table 1 it can be seen that the group as a 
whole scored rather low on the two tests with 
mean IQ’s for both the Binet and the LIPS of 
about 70, and mean MA’s of a little over seven 
years. It should also be noted that this group 
was more variable on the Binet than the 
standardizing population used on that test, 


the SD being over 22 instead of 16. 


TABLE 1 

SUMMARY OF STATISTICAL DATA ON THE REVISED 
STANFORD BINET AND THE LeITeR INTER- 

NATIONAL PERFORMANCE SCALE 








Binet (L)  —S Leiter 





Pearson 
Measure N M SD M SD r 
1Q 20 «469.85 22.61 70.55 20.65 .92+.025 
MA 20 88.5 36.3 86.0 388.3 .98+.0%1 
(months) 





Results. Obviously, because of the wide 
chronological age range and the greater varia- 
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bility of the group, some of the statistical find- 
ings will be spuriously large, but yet of general 
interest value. By referring to Table 1 the 
reader will find that the correlation between 
the IQ’s of the two tests is .92. Since the IQ 
is an index figure, this introduces a spurious 
index correlation as some of the positive corre- 
lation is a result of the large CA range. How- 
ever, when we eliminate the statistical problem 
of the index and consider the test scores in the 
form of the MA we find that the correlation 
is still a high .93. As the CA and the variability 
of the subjects have also influenced this corre- 
lation, an attempt was made to hold constant 
the CA range. In the total sample of twenty 
cases, six were found at the CA level of nine 
years. The Spearman rho, when comparing the 
MA scores of these six cases was .99. 

In addition it is noted that the scores yielded 
by the LIPS appear to be similar in magnitude 
to those of the Binet. As indicated in Table 1 
the mean IQ’s for both tests were about 70, 
and statistically there were no significant dif- 
ferences in the means and sigmas for both the 
IQ and MA scores for the two tests. 


Received July 21, 1949. 
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BALes, Rosert F. Interaction process analysis. Cam- 
bridge, Mass.: Addison-Wesley Press, 1950. Pp. 
xi + 203. $6.00. 


A detailed description of the methods used by the 
Laboratory of Social Relations at Harvard Univer- 
sity for the recording and analysis of the interac- 
tions in small groups. Itself a product of interdiscip- 
linary contacts between sociology and psychology, 
the method of analysis is one of the most promising 
tools for the advancement of the study of many in- 
terpersonal areas: psychotherapy, education, social 
action, and politics. No recent development has 
shown more clearly the rewards that follow the 
quantification of observations that previously have 
been made only qualitatively and crudely. 


BRYNGELSON, Brync, CHAPMAN, MyFANwy E., AND 
HANSEN, Orvetta K. Know yourself: A work- 
book for those who stutter. (Rev. Ed.) Minneap- 
olis: Burgess, 1950. Pp. iii + 159. $2.00. 


Know yourself is a workbook plarned for use by 
a trained speech clinician for work with those who 
stutter. The authors have arranged the material to 
give students of junior and senior high school age a 
working knowledge of mental responses to situations. 
They hope to give the student an objective attitude 
towards all his behavior responses, while working 
on the control and correction of his poor speech 
habits. Some of the lessons and the plan of attack 
will be of special interest to all who help students 
of that age level, such as group leaders and parents. 
Since many problems will be opened up, care should 
be taken that the lessons are presented by trained 
personnel. Serious difficulties might arise if problems 
are uncovered without the presence of a leader who 
is able either to give help or to direct the student 
to someone who can resolve his problem.—B.M.L. 


Encte, T. L. Psychology. (Rev. Ed.) Yonkers, N. Y.: 
World Book Co., 1950. Pp. xi + 628. 


A new edition of an elementary survey of applied 
psychology, for use in senior high schools and junior 
colleges. While much of the text is devoted to per- 
sonality and personal adjustment, the selection and 
presentation of material often fails to achieve maxi- 





Nore: Some reviews in this issue were prepared by 
the Associate Editors, who may be identified by their 
initials. Unsigned reviews are by the Editor. —L.F.S. 
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mum effectiveness for the intended level of student; 
witness the page and a half devoted to superficial 
details about differences between individual, group, 
and multiple-choice Rorschach. 


FeLpenkraAis, M. Body and mature behavior. New 
York: International Universities Press, 1949. Pp. 
viii + 167. $3.75. 


The perplexing subtitle of this small book, “A 
Study of Anxiety, Sex, Gravitation, and Learning,” 
may seem like a cross-out-one-wrong-word test item 
until the author’s thesis is disclosed. Feldenkrais ex- 
amines the implications of the erect human posture, 
and of various modes of carriage and movement, for 
the development of personality and for psychothera- 
py. The theory has its points of interest, but does 
not quite persuade. 


Harsu, CHARLES M., AND ScurickeL, H. G. Person 
ality development and assessment. New York: 
Ronald, 1950. Pp. vii + 518. $5.00. 


This book is intended as a text for courses in the 
development and assessment of personality, or of 
personology, the authors having adopted Murray’s 
term. Part I considers the problems of individuality 
as they arise in the developmental sequence from 
conception to old age. A series of chapters is concern- 
ed with the successive age levels, and in each in- 
stance the discussion covers the factors influencing 
personality; the major problems and mechanisms of 
adjustment; changes in motivation, ability and 
learned adjustment patterns; the significance of the 
phase for later developments; and characteristic in- 
dividual variations. There is also a chapter on ad- 
justments to love and marriage in early adulthood. 
Part II, on personality study in theory and practice, 
includes chapters on theory and on assessment. There 
is an appendix on theories of trait organization. The 
treatment of the various techniques for assessing and 
interpreting personality is necessarily sketchy, but a 
fair overall picture is given. The book will probab- 
ly make a satisfactory text, although it displays the 
disadvantages as well as the advantages of a gen- 
eral eclecticism.—A.R. 


Hymes, JAmes L., Jr. Teacher listen, the children 
speak. New York: New York Committee on Men- 
tal Hygiene of the State Charities Aid Associa- 
tion, 105 E. 22 St., 1949. Pp. 44. 25¢. 

An exceptionally able and attractive pamphlet, 
designed to help teachers understand troublesome 


235 








236 NEW BOOKS AND TESTS 


children. The overaggressive child, the attention- 
getting child, the dependent child, the withdrawing 
child, and the “nervous” child are crying out their 
need for real understanding and help. 


Kusie, Lawrence S. Practical and theoretical aspects 
of psychoanalysis. New York: International Uni- 
versities Press, 1950. Pp. xvii + 252. $4.00. 


This is a revision and expansion of a book origin- 
ally published in 1936 and at that time directed to- 
ward an essentially lay audience. The present re- 
vision attempts “to present not only to laymen, but 
also to physicians, and to all students of psychiatry 
and psychology, certain theoretica! as well as practic- 
al aspects of psychoanalysis, with some indication 
of the directions in which it may be expected to de- 
velop in the future.” Your reviewer found it clear, 
well developed, and excellently written. It can be 
read with profit by all students of clinical psychol- 
ogy. — W.A.H. 


LeMKAU, Paut V. Mental hygiene in public health. 
New York: McGraw-Hill, 1949. Pp. xiv + 396. 
$4.50. 


This book grew out of the author’s experience in 
teaching mental hygiene to students in the School of 
Hygiene and Public Health at Johns Hopkins Uni- 
versity. As such, its audience is primarily profes- 
sional people who have not specialized in mental 
health problems. Part I, The Place of Mental Hy- 
giene in Public Health, is a well documented ar- 
gument for the necessity of integrating mental hy- 
giene programs in the work of federal, state and lo- 
cal public health agencies. Part II, The Develop- 
ment of the Individual, is a scholarly but easy-to- 
read summary of personality growth and develop- 
ment from birth to senility. The approach here is ec- 
lectic with emphasis on the possibilities for the mod- 
ification of growth in the interests of better mental 
health. A forty-page appendix is devoted to brief 
description of the major forms of psychopathology. 
Another appendix lists appropriate visual aids for 
use in mental hygiene programs. —E.L.K. 


MEERLOO, Joost A. M. Patterns of panic. New York: 
International Universities Press, 1950. Pp. 120. 
$2.00. 


From his first-hand experiences in The Nether- 
lands during World War II, the author describes 
and discusses the nature and causes of civilian and 
military panic. The approach is largely anecdotal, 
and provides a store of concrete illustrations perhaps 
unsurpassed by any other recent work. There is oc- 
casional reference to psychoanalytic formulations, 
but the book is not at its best when dealing with 
theory. The prescription for preventing panic—by 
preparation, information, and action-—is a common- 
sense one, but soundly based on experience. 


Retwatp, Paut Society and its criminals. New York: 
International Universities Press, 1950. Pp. xix + 
315. $4.50. 

This is a psychoanalytically oriented discussion of 


the criminal as an outgrowth of society, and of the 
motivations of the social group that inflicts the pun- 
ishment. The role of unconscious drives in criminal 
law and the need to safeguard society intelligently 
from crime in phantasy, and the criminal, are all 
analyzed in separate chapters. The book attempts to 
contribute to a deeper understanding of what crime 
means to us and how this affects our treatment of it. 
—F, McK. 


Secuin, C. ALBerto Introduction to psychosomatic 
medicine. New York: International Universities 
Press, 1950. Pp. 320. $5.00. 

The comprehensiveness of this brief introduction 
to psychosomatic medicine results in its being over- 
ly terse at times, as it occasionally descends to 
what is little more than outline form. Consequently 
the selection and treatment of material is somewhat 
uneven. There are six pages devoted to the anatomy 
of the hypothalamus and only three to its physi- 
ology; while sixty pages (of the total three hun- 
dred) are allotted to the taking of the psychosomatic 
clinical history. This unevenness detracts from the 
value of the book as an all-around text, but it is 
essentially sound, and could be used with value as 
supplementary reading at the introductory level. 
Even the experienced worker will find much that is 
stimulating, as in the author’s brief but penetrating 
discussion of the concepts of “disease” and “cure.” 


—W AH. 


Smart, Mouiie Stevens Babe in a house. New York: 
Scribner’s, 1950. Pp. 212. $2.75. 


A really good popular book about babies and their 
families, that psychologists can recommend safely to 
their layman friends. Mainly by easy-reading case 
studies, it gives many of the insights that a normal 
but perplexed parent needs. Mrs. Smart not only rec- 
ognizes, but communicates effectively, that the same 
“method” will not be best for all babies, nor for all 
parents. She avoids the middle-of-the-road confu- 
sions that often try to pass for moderation, and pre- 
sents a sound reasonableness that is based on a good 
knowledge of modern psychological research and 
theory, and a sensitive understanding of direct ex- 
perience. 


SONNEMANN, ULricH Handwriting analysis as a psy- 
chodiagnostic tool. New York: Grune & Stratton, 
1950. Pp. x + 276. $5.00. 


Why has graphology so long remained a rejected 
stepchild of psychology in America, while receiving 
more recognition in Europe? A thoughtful study of 
Sonnemann’s book perhaps gives an answer. Graph- 
ology, at least the “systematic” graphology of Klages 
that is expounded here, is content to describe the 
character of a person in terms of the character 
of his handwriting. The test and the referent are 
one and the same, whereas the empirical culture 
of American psychology demands an_ external 
referent. Illustrations are easily found in Son- 
nemann. In the chapter “Twelve Lines from 
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Joan,” illustrating the application of his anal- 
ysis, are many phrases such as these: “it seems 
to caress and cultivate rather than neglect the letter 
forms. ...,” “the quantitative extension of the move- 
ments is well measured; it never gets involved in 
‘crippling’ over-scrupulous detail. .” American 
psychology wants to know something more. Does 
Joan “caress and cultivate” her children and her 
friends? Does she “never get involved in over- 
scrupulous detail” in her home and in her work? 
Such questions remain unanswered. It will be no 
solution to regress to atomistic letter-measuring in 
order to be objective and quantitative, for, as Sonne- 
mann rightly says, that approach has been unre- 
warding. However, newer global approaches being 
developed for the validation of projective techniques 
may well prove applicable to graphology also. Mean- 
while, clinical psychology may withhold judgment, 
neither rejecting graphology as quackery, nor es- 
pousing it too warmly, until the further evidence be- 
comes available. 


Steiner, M. E. The psychologist in industry. Spring- 
field, Ull.: Charles C Thomas, 1949. Pp. vii + 
107. $2.00. 


A lecture-in-print on the roles of the industrial 
psychologist in selecting and placing workers, in 
working with them on the job, and in counseling 
them. For so small a volume, the summaries of re- 
search are surprisingly full. The digest of the appli- 
cations of the Rorschach to industrial situations is 
especially complete, although sometimes uncritical. 
The newer responsibility of the psychologist as a 
personal counselor in industry is recognized ade- 
quately. 


Sreckie, Lynpe C. Problems of human adjustment. 
New York: Harper, 1949. Pp. x + 351. $3.00. 


The author states that in this work he attempts 
“to introduce man to himself.” Through generous 
use of examples he develops the theory that “the 
health of an individual is determined by the reaction 
between inner necessity and external demands.” Ap- 
plication of his thesis is discussed in terms of the in- 
dividual’s adjustment to five areas of adult life: 
sexual adjustment, courtship and marriage, work, 
religion and old age. Though the author states 
clearly that adult behavior is learned behavior, he 
places greater emphasis on the effect in adult life 
of adaptive and nonadaptive behavior than on con- 
ditions in early life creating these responses. The 
style and subject matter of this book are personalized 
and directed toward stimulation of self-appraisal. 


Trencorp, R. F. Human relations in modern industry. 
New York: International Universities Press, 1950. 
Pp. 192. $2.50. 


This volume consists, in the main, of a series of 
lectures delivered at a course in human relations in 
industry at Roffey Park Rehabilitation Center, Lon- 
don, England. The lectures were originally designed 
as stimuli to discussion for the participants of the 
course—psychiatrists, industrial physicians, nurses, 


personnel officers, psychiatric social workers, a union 
leader, and others. Because of the mixed audience 
for whom these lectures were designed, it may be 
unfair to assess the contents at face value, but the 
fact remains that psychologists and trained personnel 
workers will find the treatment too elementary. Psy- 
chiatric principles are stated in oversimplified terms, 
and principles of “human relations” are in terms of 
nonpsychiatric capsules. Those not oriented in mod- 
ern psychology may find the mixture of psychological 
fragments, inspirational examples, and homespun 
quotations interesting. —M.K. 


1950 Directory of Vocational Counseling Services. 
Ethical Practices Committee, National Vocational 
Guidance Association. St. Louis 5, Mo.: Ethical 
Practices Com., Box 64, Washington Univ., 1950. 
Pp. 98. $1.00. 


“Sooner or later every profession that wins public 
acceptance faces the problem of policing its own 
practitioners, and helping the lay public to identify 
the honest workman,” states the president of the N. 
V.G.A. in his foreword to this important pamphlet. 
Culminating continuous efforts since 1921, the Di- 
rectory indicates that the profession of vocational 
guidance has indeed reached a high degree of ma- 
turity. The minimum standards for vocational guid- 
ance service are described explicitly in an introduc- 
tory section, in terms of recognition, personnel, pro- 
cedures, advertising, and fees. The body of the 
pamphlet gives a geographical listing of 131 agencies 
and individual practitioners, each of which was in- 
vestigated by questionnaire, by references, and by a 
visit. Listing is voluntary, and only agencies which 
apply are studied. For each agency, the sponsorship, 
services, clientele, fees, method of application, di- 
rector, staff, and hours of service are stated. Psycholo- 
gists throughout the country will find the pamphlet 
an invaluable guide to comunity resources. 


TESTS 


Ammons, Rosert B., Butter, MARGARET N. AND 
Herzic, SAM A. Vocational Apperception Test. 
High school-college-adult. Individual test. 18 
plates. (10 for men, 8 for women), with manual, 
pp. 7 ($7.25). Louisville, Ky.: R. B. Ammons, Un- 
iv. of Louisville, 1949. 

An experimental edition of a set of pictures, each 
representing one occupation. Stories, obtained and 
analyzed in a manner similar to the TAT, are in- 
tended to evoke expressions. of occupational prefer- 
ence, reason for entering an occupation, personal 
needs and mechanisms, and outcomes. The manual 
gives only meager data concerning results. 


Lancmurr, CHARLES R. Oral Directions Test. Re- 
vised manual, pp. 8. New York: Psychological 
Corp., 1950. 


The revised manual for this phonograph-adminis- 
tered ability test gives additional normative data 
for children, veterans, and industrial workers, and 
includes some findings on the validity of the method 
for industrial selection, 
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